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METHODS OF DIAGNOSIS AND PROGNOSIS OF OVARIAN CANCER 



Field of the invention 

The present invention relates to the identification of nucleic acid and protein expression 
5 profiles and nucleic acids, products, and antibodies thereto that are involved in ovarian 
cancer, and to the use of such expression profiles and compositions in the diagnosis, 
prognosis and therapy of ovarian cancer. More particularly, this invention relates to 
novel genes that are expressed at elevated or reduced levels in malignant tissues and 
uses therefor in the diagnosis of cancer or malignant tumors in human subjects. This 

10 invention also relates to the use of nucleic acid or antibody probes to specifically detect 
ovarian cancer cells, such as, for example, in the ovarian surface epithelium, wherein 
over-expression or reduced expression of nucleic acids hybridizing to the probes is highly 
associated with the occurrence and/or recurrence of an ovarian tumor, and/or the 
likelihood of patient survival. The diagnostic and prognostic test of the present invention 

15 is particularly useful for the early detection of ovarian cancer or metastases thereof, or 
other cancers, and for monitoring the progress of disease, such as, for example, during 
remission or following surgery or chemotherapy. The present invention is also directed 
to methods of therapy wherein the activity of a protein encoded by a 
diagnostic/prognostic gene described herein is modulated. 

20 

Background of the invention 
1. General 

As used herein the term "derived from" shall be taken to indicate that a specified integer 
are obtained from a particular source albeit not necessarily directly from that source. 

25 

Unless the context requires otherwise or specifically stated to the contrary, integers, 
steps, or elements of the invention recited herein as singular integers, steps or elements 
clearly encompass both singular and plural forms of the recited Integers, steps or 
elements. 

30 

The embodiments of the invention described herein with respect to any single 
embodiment shall be taken to apply mutatis mutandis to any other embodiment of the 
invention described herein. 
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Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to imply 
the inclusion of a stated step or element or integer or group of steps or elements or 
integers but not the exclusion of any other step or element or integer or group of 
elements or integers. 

Those skilled in the art will appreciate that the invention described herein is susceptible 
to variations and modifications other than those specifically described. It is to be 
understood that the invention includes all such variations and modifications. The 
invention also includes all of the steps, features, compositions and compounds referred 
to or indicated in this specification, individually or collectively, and any and all 
combinations or any two or more of said steps or features. 

The present invention is not to be limited in scope by the specific examples described 
herein. Functionally equivalent products, compositions and methods are clearly within the 
scope of the invention, as described herein. 

The present invention is performed without undue experimentation using, unless 
otherwise indicated, conventional techniques of molecular biology, microbiology, virology, 
recombining DNA technology, peptide synthesis in solution, solid phase peptide 
synthesis, and immunology. Such procedures are described, for example, in the 
following texts that are incorporated herein by reference: 

1. Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratories, New York, Second Edition (1989), whole of Vols I, II, 
and III; 

2. DNA Cloning: A Practical Approach, Vols. I and II (D. N. Glover, ed., 1985), IRL 
Press, Oxford, whole of text; 

3. Oligonucleotide Synthesis: A Practical Approach (M. J. Gait, ed., 1984) IRL 
Press, Oxford, whole of text, and particularly the papers therein by Gait, pp1-22; 
Atkinson et a/., pp35-81; Sproat et aL, pp 83-1 15; and Wu et aL, pp 135-151; 

4. Nucleic Acid Hybridization: A Practical Approach (B. D. Hames & S. J. Higgins, 
eds., 1985) IRL Press, Oxford, whole of text; 

5. Perbal, B., A Practical Guide to Molecular Cloning (1984); 

6. Wunsch, E., ed. (1974) Synthese von Peptiden in Houben-Weyls Metoden der 
Organischen Chemie (Muler, E., ed.), vol. 15, 4th edn., Parts 1 and 2, Thieme, 
Stuttgart. 
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7. Handbook of Experimental Immunology, Vols. I-IV (D. M. Weir and C. C. 
Blackwell, eds., 1986, Blackwell Scientific Publications). 

This specification contains nucleotide and amino acid sequence information prepared 
using Patentln Version 3.1, presented herein after the claims. Each nucleotide sequence 
is identified in the sequence listing by the numeric indicator <210> followed by the 
sequence identifier (e.g. <210>1, <210>2, <210>3, etc). The length and type of 
sequence (DNA, protein (PRT), etc), and source organism for each nucleotide sequence, 
are indicated by information provided in the numeric indicator fields <211>, <212> and 
<213>, respectively. Nucleotide sequences referred to in the specification are defined by 
the term "SEQ ID NO:", followed by the sequence identifier (eg. SEQ ID NO: 1 refers to 
the sequence in the sequence listing designated as <400>1). 

The designation of nucleotide residues referred to herein are those recommended by the 
IUPAC-IUB Biochemical Nomenclature Commission, wherein A represents Adenine, C 
represents Cytosine, G represents Guanine, T represents thymine, Y represents a 
pyrimidine residue, R represents a purine residue, M represents Adenine or Cytosine, K 
represents Guanine or Thymine, S represents Guanine or Cytosine, W represents 
Adenine or Thymine, H represents a nucleotide other than Guanine, B represents a 
nucleotide other than Adenine, V represents a nucleotide other than Thymine, D 
represents a nucleotide other than Cytosine and N represents any nucleotide residue. 

2. Description of the related art 

Cancer is a multi-factorial disease and major cause of morbidity in humans and other 
animals, and deaths resulting from cancer in humans are increasing and expected to 
surpass deaths from heart disease in future. Carcinomas of the lung, prostate, breast, 
colon, pancreas, and ovary are major contributing factors to total cancer death in 
humans. For example, prostate cancer is the fourth most prevalent cancer and the 
second leading cause of cancer death in males. Similarly, cancer of the ovary is the 
second most common cancer of the female reproductive organs and the fourth most 
common cause of cancer death among females. With few exceptions, metastatic 
disease from carcinoma is fatal. Even if patients survive their primary cancers, 
recurrence or metastases are common. 

It is widely recognized that simple and rapid tests for solid cancers or tumors have 
considerable clinical potential. Not only can such tests be used for the early diagnosis of 
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cancer but they also allow the detection of tumor recurrence following surgery and 
chemotherapy. A number of cancer-specific blood tests have been developed which 
depend upon the detection of tumor-specific antigens in the circulation (Catalona, W.J., 
et a/., 1991, "Measurement of prostate-specific antigen in serum as a screening test for 
5 prostate cancer", N. Engl. J. Med. 324, 1156-1161; Barrenetxea, G., et a/., 1998, "Use of 
serum tumor markers for the diagnosis and follow-up of breast cancer", Oncology, 55, 
447-449; Cairns, P., and Sidreansky, D., 1999, "Molecular methods for the diagnosis of 
cancer". Biochim. Biophys. Acta. 1423, C 11-C 18). 

10 Ovarian cancer is the fourth most frequent cause of cancer death in females and in the 
United States, and accounts for approximately 13,000 deaths annually. Furthermore, 
ovarian cancer remains the number one killer of women with gynaecological malignant 
hyperplasia and the incidence is rising in industrialized countries. The etiology of the 
neoplastic transformation remains unknown although there is epidemiological evidence 

15 for an association with disordered endocrine function. The incidence of ovarian 
carcinoma is higher in nulliparous females and in those with early menopause. 

Most ovarian cancers are thought to arise from the ovarian surface of epithelium (OSE). 
Epithelial ovarian cancer is seldom encountered in women less than 35 years of age; Its 

20 incidence increases sharply with advancing age and peaks at ages 75 to 80, with the 
median age being 60 years. The single most important known risk factor is a strong 
familial history of breast or ovarian cancer. To date, little is known about the structure 
and function of the OSE cells. It is known that the OSE is highly dynamic tissue that 
undergoes morphogenic changes, and has proliferative properties sufficient to cover the 

25 ovulatory site following ovulation. Morphological and histochemical studies suggest that 
the OSE has secretory, endocytotic and transport functions which are hormonally- 
controlled (Blaustein and Lee, Oncol 8, 34-43, 1979; Nicosia and Johnson, Int. J. 
Gynecol. Pathol, 3, 249-260, 1983; Papadaki and Beilby, J. Cell Scl 8, 445-464, 1971; 
Anderson et a/., J. Morphol, 750, 135-164, 1976). 

30 

Ovarian cancers are not readily detectable by diagnostic techniques (Siemens et al, J. 
Cell Physiol, 134: 347-356, 1988). In fact, the diagnosis of carcinoma of the ovary is 
generally only possible when the disease has progressed to a late stage of development. 
Approximately 75% of women diagnosed with ovarian cancer are already at an advanced 
35 stage (III and IV) of the disease at their initial diagnosis. During the past 20 years, neither 
diagnosis nor five year survival rates have greatly improved for these patients. This is 
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substantially due to the high percentage of high-stage initial detection of the disease. 
There is therefore a need to develop new markers that improve early diagnosis and 
thereby reduce the percentage of high-stage initial diagnoses. 

5 A number of proteinaceous ovarian tumor markers were evaluated several years ago, 
however these were found to be non-specific; and determined to be of low value as 
markers for primary ovarian cancer (Kudlacek et a/., Gyn. One. 35, 323-329, 1989; 
Rustin etal. t J. Clin. One, 7, 1667-1671, 1989; Sevelda etal.,Am. J. Obstet Gynecol., 
161, 1213-1216, 1989; Omar et a/., Tumor Biol., 10, 316-323, 1989). Several 

10 monoclonal antibodies were also shown to react with ovarian tumor associated antigens, 
however they were not specific for ovarian cancer and merely recognize determinants 
associated with high molecular weight mucin-like glycoproteins (Kenemans et a/., Eur. J. 
Obstet Gynecol. Repod. Biol. 29, 207-218, 1989; McDuffy, Ann. Clin. Biochem., 26, 379- 
387, 1989). More recently, oncogenes associated with ovarian cancers have been 

15 identified, including HER-2/neu {c-erbB-2) which is over-expressed in one-third of ovarian 
cancers (USSN 6,075,122 by Cheever et al, issued June 13, 2000), the fms oncogene, 
and abnormalities in the p53 gene, which are seen in about half of ovarian cancers. 

Whilst previously identified markers for carcinomas of the ovary have facilitated efforts to 
20 diagnose and treat these serious diseases, there is a clear need for the identification of 
additional markers and therapeutic targets. The identification of tumor markers that are 
amenable to the early-stage detection of localized tumors is critical for more effective 
management of carcinomas of the ovary. 

25 Summary of the invention 

In work leading up to the present invention, the inventors sought to identify nucleic acid 
markers that were diagnostic of ovarian cancers generally, or diagnostic of specifc 
ovarian cancers such as, for example, serous ovarian cancer (SOC), mucinous ovarian 
cancer (MOC), non-invasive (borderline ovarian cancer or low malignant potential 

30 ovarian cancer), mixed phenotype ovarian cancer, endometrioid ovarian cancer (EnOC) 
and clear cell ovarian cancer (CICA), papillary serous ovarian cancer, Brenner cell or 
undifferentiated adenocarcinoma, by virtue of their modulated expression in cancer 
tissues derived from a patient cohort compared to their expression in healthy or non- 
cancerous cells and tissues. Additionally, the inventors sought to determine whether any 

35 correlation exists between the expression of any particular gene in a subject having 
ovarian cancer and the survival, or likelihood for survival, of the subject during the 
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medium to long term (i.e. in the period between about 1-2 years from primary diagnosis, 
or longer). The inventors also sought to to determine whether any correlation exists 
between the expression of any particular gene in a subject following treatment for ovarian 
cancer and the recurrence, or likelihood for recurrence, of ovarian cancer in the subject 
during the medium to long term (i.e. in the period between about 1-2 years from primary 
diagnosis, or longer). 

As exemplified herein, the inventors identified a number of genes whose expression is 
altered (up-regulated or down-regulated) in individuals with ovarian cancer compared to 
healthy individuals., eg., subjects who do not have ovarian cancer. The particular genes 
are identified in Tables 1 and 2. Preferably, the genes are selected from the group of 
candidate genes set forth in Table 3. 

The list of genes and proteins exemplified herein by Table 1 were identified by a 
statistical analysis as outlined in the examples which gave a P-value, eg., by comparison 
of expression to the expression of that gene in normal ovaries. 

Accordingly, one aspect of the present invention provides a method of detecting an 
ovarian cancer-associated transcript in a biological sample, the method comprising 
contacting the biological sample with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Table 1 or 2 or 3. Preferably 
the percentage identity to a sequence disclosed in any one of Tables 1-3 is at least 
about 85% or 90% or 95%, and still more preferably at least about 98% or 99%. 

In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with a nucleic acid probe 
for a time and under conditions sufficient for hybridization to occur and then detecting the 
hybridization wherein a modified level of hybridization of the probe for the subject being 
tested compared to the hybridization obtained for a control subject not having ovarian 
cancer indicates that the subject being tested has an ovarian cancer, and wherein said 
nucleic acid probe comprises a sequence selected from the group consisting of: 
(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 
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(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 

' 5 69, 71, 73, 75, 77, 79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81 and 83; 

10 (iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
34, 36, 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82 and 84; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 

15 (ii) or (iii) or (iv). 

In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with a nucleic acid probe 
20 for a time and under conditions sufficient for hybridization to occur and then detecting the 
hybridization wherein a modified level of hybridization of the probe for the subject being 
tested compared to the hybridization obtained for a control subject not having ovarian 
cancer indicates that the subject being tested has an ovarian cancer, and wherein said 
nucleic acid probe comprises a sequence selected from the group consisting of: 
25 (i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
30 to at least about 20 contiguous nucleotides from a sequence selected from the 

group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
35 group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
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29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71,73, 75, 77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 

5 34, 36, 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 

78, 80, 82 and 84; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

10 Even more preferably, the present invention provides a method of diagnosing an ovarian 
cancer in a human or animal subject being tested said method comprising contacting a 
biological sample from said subject being tested with a nucleic acid probe for a time and . 
under conditions sufficient for hybridization to occur and then detecting the hybridization 
wherein a modified level of hybridization of the probe for the subject being tested 

15 compared to the hybridization obtained for a control subject not having ovarian cancer 
indicates that the subject being tested has an ovarian cancer, and wherein said nucleic 
acid probe comprises a sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 

20 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 

67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 

25 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 

77, 79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 

30 77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 6, 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 
and 84; and 

35 (v) a sequence that is complementary to (i) or (ii) or (iii) or (iv). 



WO 2004/022778 PCT/AU2003/001 166 

9 

As used herein, the term "modified level" includes an enhanced, increased or elevated 
level of an integer being assayed, or alternatively, a reduced or decreased level of an 
integer being assayed. 

In one embodiment an elevated, enhanced or. increased level of expression of the nucleic 
acid is detected. In accordance with this embodiment, the present invention provides a 
method of diagnosing an ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with a 
nucleic acid probe for a time and under conditions sufficient for hybridization to occur and 
then detecting the hybridization wherein an enhanced level of hybridization of the probe 
for the subject being tested compared to the hybridization obtained for a control subject 
not having ovarian cancer indicates that the subject being tested has an ovarian ovarian 
cancer, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 or 2 other than a nucleic acid having an Accession 
Number selected from the group consisting of NM_022117, NM_005460, 
NMJ)02387, AI745249 and AI694200; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
or 2 other than a nucleic acid having an Accession Number selected from the 
group consisting of NM_022117, NM_005460, NM_002387, AI745249 and 
AI694200; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 or 2 other than a nucleic acid having an Accession Number selected from 
the group consisting of NM_022117, NM_005460, NM_002387, AI745249 and 
AI694200; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with a nucleic acid probe 
for a time and under conditions sufficient for hybridization to occur and then detecting the 
hybridization wherein an enhanced level of hybridization of the probe for the subject 
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being tested compared to the hybridization obtained for a control subject not having 
ovarian cancer indicates that the subject being tested has an ovarian cancer, and 
wherein said nucleic acid probe comprises a sequence selected from the group 
consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21 , 23, 
25, 27, 29, 31, 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 45, 46, 48, 52, 53,. 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84; 
and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

In an alternative preferred embodiment, a reduced level of a diagnostic marker is 
indicative of ovarian cancer. In accordance with this embodiment, the present invention 
provides a method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein a reduced level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 
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(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of NMJJ22117, NMJ)05460, NM_002387, AI745249 and AI694200; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
5 to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 

and having an Accession Number selected from the group consisting of 
NMJ322117, NM_005460, NM_002387, AI745249 and AI694200; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
10 Table 1 and having an Accession Number selected from the group consisting of 

NM_0221 1 7, NM_005460, NM_002387, AI745249 and AI694200; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

15 In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with a nucleic acid probe 
for a time and under conditions sufficient for hybridization to occur and then detecting the 
hybridization wherein a reduced level of hybridization of the probe for the subject being 
20 tested compared to the hybridization obtained for a control subject not having ovarian 
cancer indicates that the subject being tested has an ovarian cancer, and wherein said 
nucleic acid probe comprises a sequence selected from the group consisting of: 
(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 3, and 5; 
25 (ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 1 , 3, and 5; 
(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, and 5; 
30 (iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, and 6; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 



35 Preferably, the ovarian cancer that is diagnosed according to the present invention is an 
epithelial ovarian cancer, such as, for example, serous ovarian cancer, non-invasive 
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ovarian cancer, mixed phenotpye ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, 
Brenner cell or undifferentiated adenocarcinoma. As will be apparent from the preferred 
embodiments described below, certain of the genes represented in Table 1, Table 2 and 
5 Table 3 are expressed at modified levels in subjects having serous or mucinous ovarian 
cancers. Data presented in Figures 1-4 also exemplifiy novel diagnostics and 
prognostics for serous ovarian cancer,, mucinous ovarian cancer, endometrioid ovarian 
cancer and clear cell ovarian cancer. 

10 As exemplified herein by Table 2, the present inventors have identified those genes 
having an elevated or reduced average ratio of expression of specific genes between 
ovarian cancer patients vs non-ovarian cancer patients, wherein a high ratio in Table 2 
indicates an enhanced expression in an ovarian cancer patients and wherein a negative 
ratio indicates that a reduced expression in an ovarian cancer patient. 

In an alternative preferred embodiment, the present invention provides a method of 
diagnosing a serous ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with a 
nucleic acid probe for a time and under conditions sufficient for hybridization to occur and 
20 then detecting the hybridization wherein a modified level of hybridization of the probe for 
the subject being tested compared to the hybridization obtained for a control subject not 
having ovarian cancer indicates that the subject being tested has a serous ovarian 
cancer, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

25 (i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 2 or as set forth in Table 1 and having an Accession 
Number selected from the group consisting of: U62801, D49441, X51630, and 
AB018305; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
30 to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 2 

or as set forth in Table 1 and having an Accession Number selected from the 
group consisting of: U62801, D49441, X51630, And AB018305; 
(Hi) a sequence that is at least about 80% Identical to (i) or (ii); 
(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
35 Table 2 or as set forth in Table 1 and having an Accession Number selected from 

the group consisting of: U62801, D49441, X51630, And AB018305; and 
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(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

In a further alternative preferred embodiment, the present invention provides a method of 
5 diagnosing a mucinous ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with a 
nucleic acid probe for a time and under conditions sufficient for hybridization to occur and 
then detecting the hybridization wherein an elevated level of hybridization of the probe for 
the subject being tested compared to the hybridization obtained for a control subject not 
10 having ovarian cancer indicates that the subject being tested has a mucinous ovarian 
cancer, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 

15 consisting of: NM_006149, AA315933, U47732, NM_005588, AW503395, 

NMJ)04063, AI073913, AI928445, NM_022454, W40460, AA1 32961 and 
AF111856; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 

20 and having an Accession Number selected from the group consisting of: 

NMJ306149, AA315933, U47732, NM_005588, AW503395, NM_004063, 
AI073913, AI928445, NM_022454, W40460, AA1 32961 and AF111856; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
25 Table 1 and having an Accession Number selected from the group consisting of: 

NM_006149, AA315933, U47732, NMJJ05588, AW503395, NM_004063, 
AI073913, AI928445, NMJ322454, W40460, AA1 32961 and AF1 11856; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

30 

In a preferred embodiment, the present invention provides a method of diagnosing a 
mucinous ovarian cancer in a human or animal subject being tested said method 
comprising contacting a biological sample from said subject being tested with a nucleic 
acid probe for a time and under conditions sufficient for hybridization to occur and then 
35 detecting the hybridization wherein an enhanced level of hybridization of the probe for 
the subject being tested compared to the hybridization obtained for a control subject not 
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having ovarian cancer indicates that the subject being tested has an ovarian cancer, and 
wherein said nucleic acid probe comprises a sequence selected from the group 
consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from SEQ ID NO: 
5 57 or 59 or 61; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from SEQ ID NO: 57 or 59 or 61 ; 

(iii) a sequence that is at least about 80% identical to SEQ ID NO: 57 or 59 or 61 ; 

(iv) a sequence that encodes the amino acid sequence set forth in SEQ ID NO: 58 or 
10 60 or 62; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

Those skilled in the art will be aware that as a carcinoma progresses, metastases occur 
15 . in organs and tissues outside the site of the primary tumor. For example, in the case of 
ovarian cancer, metastases commonly appear in a tissue selected from the group 
consisting of omentum, abdominal fluid, lymph nodes, lung, liver, brain, and bone. 
Accordingly, the term "ovarian cancer" as used herein shall be taken to include an early 
or developed tumor of the ovary, such as, for example, any one or more of a number of 
20 cancers of epithelial origin, such as serous, mucinous, endometrioid, clear cell, papillary 
serous, Brenner cell or undifferentiated adenocarcinoma, non-invasive ovarian cancer 
such as borderline ovarian cancer or low-malignant potential ovarian cancer, or a mixed 
phenotype ovarian cancer, and optionally, any metastases outside the ovary that occurs . 
in a subject having a primary tumor of the ovary. 

25 

As used herein, the term "diagnosis", and variants thereof, such as, but not limited to 
"diagnose", "diagnosed" or "diagnosing" shall not be limited to a primary diagnosis of a 
clinical state, however should be taken to include any primary diagnosis or prognosis of a 
clinical state. For example, the "diagnostic assay" formats described herein are equally 
30 relevant to assessing the remission of a patient, or monitoring disease recurrence, or 
tumor recurrence, such as following surgery or chemotherapy, or determining the 
appearance of metastases of a primary tumor. All such uses of the assays described 
herein are encompassed by the present invention. 



35 Both classical hybridization and amplification formats, and combinations thereof, are 
encompassed by the invention. In one embodiment, the hybridization comprises 
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performing a nucleic acid hybridization reaction between a labeled probe and a second 
nucleic acid in the biological sample from the subject being tested, and detecting the 
label. In another embodiment, the hybridization comprising performing a nucleic acid 
amplification reaction eg., polymerase chain reaction (PCR), wherein the probe consists 
5 of a nucleic acid primer and nucleic acid copies of the nucleic acid in the biological 
sample are amplified. As will be known to the skilled artisan, amplification may proceed 
classical nucleic acid hybridization detection systems, to enhance specificity of detection, 
particularly in the case of less abundant mRNA species in the sample. 

10 In a preferred embodiment, the polynucleotide is immobilised on a solid surface. 

The present invention clearly encompasses nucleic acid-based methods and protein- 
based methods for diagnosing cancer in humans and other mammals. 

15 Accordingly, in a related embodiment, the present invention provides a method of 
detecting an ovarian cancer-associated polypeptide in a biological sample the method 
comprising contacting the biological sample with an antibody that binds specifically to an 
ovarian cancer-associated polypeptide in the biological sample, the polypeptide being 
encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% 

20 identical to a sequence as shown in Tables 1-3. 

Preferably the percentage identity to a sequence disclosed in any one of Tables 1-3 is at 
least about 85% or 90% or 95%, arid still more preferably at least about 98% or 99%. 

25 In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with an antibody for a time 
and under conditions sufficient for an antigen-antibody complex to form and then 
detecting the complex wherein a modified level of the antigen-antibody complex for the 

30 subject being tested compared to the amount of the antigen-antibody complex formed for 
a control subject not having ovarian cancer indicates that the subject being tested has an 
ovarian cancer, and wherein said antibody binds to a polypeptide comprising an amino 
acid sequence comprising at least about 10 contiguous amino acid residues of a 
sequence having at least about 80% identity to a sequence selected from the group 

35 consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
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36, 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 
and 84. 

In a preferred embodiment, the present invention provides a method of diagnosing an 
5 ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with an antibody for a time 
and under conditions sufficient for an antigen-antibody complex to form and then 
detecting the complex wherein a modified level of the antigen-antibody complex for the 
subject being tested compared to the amount of the antigen-antibody complex formed for 

10 a control subject not having ovarian cancer indicates that the subject being tested has ah 
ovarian cancer, and wherein said antibody binds to a polypeptide comprising an amino 
acid sequence comprising at least about 10 contiguous amino acid residues of a 
sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 2, 6, 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 36, 38, 

15 40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

In one embodiment an elevated, enhanced or increased level of expression of the 
antigen-antibody complex is detected. In accordance with this embodiment, the present 
invention provides a method of diagnosing an ovarian cancer in a human or animal 

20 subject being tested said method comprising contacting a biological sample from said 
subject being tested with an antibody for a time and under conditions sufficient for an 
antigen-antibody complex to form and then detecting the complex wherein an enhanced 
level of the antigen-antibody complex for the subject being tested compared to the 
amount of the antigen-antibody complex formed for a control subject not having ovarian 

25 cancer indicates that the subject being tested has an ovarian cancer, and wherein said 
antibody binds to a polypeptide comprising an amino acid sequence comprising at least 
about 10 contiguous amino acid residues of a polypeptide encoded by a nucleic acid set 
forth in Table 1 or 2 other than a nucleic acid having an Accession Number selected from 
the group consisting of NM J)221 17, NM_005460, NMJ502387, AI745249 and AI694200. 

30 

In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with an antibody for a time 
and under conditions sufficient for an antigen-antibody complex to form and then 
35 detecting the complex wherein an enhanced level of the antigen-antibody complex for the 
subject being tested compared to the amount of the antigen-antibody complex formed for 
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a control subject not having ovarian cancer indicates that the subject being tested has an 
ovarian cancer, and wherein said antibody binds to a polypeptide comprising an amino 
acid sequence comprising at least about 10 contiguous amino acid residues of a 
sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

In an alternative preferred embodiment, a reduced level of a diagnostic marker is 
indicative of ovarian cancer. In accordance with this embodiment, the present invention 
provides a method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein a reduced level of the antigen- 
antibody complex for the subject being tested compared to the amount of the antigen- 
antibody complex formed for a control subject not having ovarian cancer indicates that 
the subject being tested has an ovarian cancer, and wherein said antibody binds to a 
polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a polypeptide encoded by a nucleic acid set forth in Table 1 and 
having an Accession Number selected from the group consisting of NMJ522117, 
NM_005460, NMJ302387, AI745249 and AI694200. 

In a preferred embodiment, the present invention provides a method of diagnosing an 
ovarian cancer in a human or animal subject being tested said method comprising 
contacting a biological sample from said subject being tested with an antibody for a tome 
and under conditions sufficient for an antigen-antibody complex to form and then 
detecting the complex wherein a reduced level of the antigen-antibody complex for the 
subject being tested compared to the amount of the antigen-antibody complex formed for 
a control subject not having ovarian cancer indicates that the subject being tested has an 
ovarian cancer, and wherein said antibody binds to a polypeptide comprising an amino 
acid sequence comprising at least about 10 contiguous amino acid residues of a 
sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, and 6. 

Preferably, the ovarian cancer that is diagnosed according to the present invention is an 
epithelial ovarian cancer, such as, for example, serous ovarian cancer or mucinous 
ovarian cancer. 
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In an alternative preferred embodiment, the present invention provides a method of 
diagnosing a serous ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with an 
5 antibody for a time and under conditions sufficient for an antigen-antibody complex to 
form and then detecting the complex wherein a modified level of the antigen-antibody 
complex for the subject being tested compared to the amount of the antigen-antibody 
complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has a serous ovarian cancer, and wherein said antibody binds to a 
10 polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a polypeptide encoded by a nucleic acid set forth in Table 2 or as 
set forth in Table 1 and having an Accession Number selected from the group consisting 
of: U62801 , D49441 , X51 630, And AB01 8305. 

15 In a further alternative preferred embodiment, the present invention provides a method of 
diagnosing a mucinous ovarian cancer in a human or animal subject being tested said 
method comprising contacting a biological sample from said subject being tested with an 
antibody for a time and under conditions sufficient for an antigen-antibody complex to 
form and then detecting the complex wherein a reduced level of the antigen-antibody 

20 complex for the subject being tested compared to the amount of the antigen-antibody 
complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has a mucinous ovarian cancer, and wherein said antibody binds to a 
polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a polypeptide encoded by a nucleic acid set forth in Table 1 and 

25 having an Accession Number selected from the group consisting of: NM_006149, 
AA315933, U47732, NM_005588, AW503395, NM_004063, AI073913, AI928445, 
NMJ)22454, W40460, AA1 32961 and AF111856. 

In a preferred embodiment, the present invention provides a method of diagnosing a 
30 mucinous ovarian cancer in a human or animal subject being tested said method 
comprising contacting a biological sample from said subject being tested with an 
antibody for a time and under conditions sufficient for an antigen-antibody complex to 
form and then detecting the complex wherein an enhanced level of the antigen-antibody 
complex for the subject being tested compared to the amount of the antigen-antibody 
35 complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has a mucinous ovarian cancer, and wherein said antibody binds to a 
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polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a sequence having at least about 80% identity to SEQ ID NO: 58 
or 60 or 62. 

5 In a further related embodiment, the present invention provides a method of detecting an 
ovarian cancer-associated antibody in a biological sample the method comprising 
contacting the biological sample with a polypeptide encoded by a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 
Tables 1-3, wherein the polypeptide specifically binds to the ovarian cancer-associated 
10 antibody. 

Preferably, in the above methods, the biological sample is contacted with a plurality of 
the polynucleotides, polypeptides or antibodies referred to above. 

15 In a particularly preferred embodiment, the present invention provides an antibody-based 
mulptiptex assay for determing the likelihood of survivial of a subject from an ovarian 
cancer. In one embodiment, the invention provides a method of determining the 
likelihood of survival of a subject suffering from a serous ovarian cancer, said method 
comprising contacting a biological sample from said subject being tested with at least two 

20 antibodies for a time and under conditions sufficient for antigen-antibody complexes to 
form and then detecting the complexes wherein an enhanced level of the antigen- 
antibody complexes for the subject being tested compared to the amount of the antigen- 
antibody complexes formed for a control subject not having ovarian cancer indicates that 
the subject being tested has a poor probability of survival, and wherein one antibody 

25 binds to an sFRP polypeptide comprising the amino acid sequence set forth in SEQ ID 
NO: 72 and wherein one antibody binds to a SOCS3 polypeptide comprising the amino 
acid sequence set forth in SEQ ID NO: 74. 

The present invention is not to be limited by the source or nature of the biological 
30 sample. In one embodiment, the biological sample is from a patient undergoing a 
therapeutic regimen to treat ovarian cancer. In an alternative preferred embodiment, the 
biological sample is from a patient suspected of having ovarian cancer. 

In addition to providing up-regulated and down-regulated genes, the list of genes and 
35 proteins exemplified herein by Table 1 were identified by a statistical analysis as outlined 
in the examples which gave a P-value, eg., by comparison of expression to 
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clinicopathological parameters for disease recurrence or patient survival. Accordingly, 
the present invention is particularly useful for prognostic applications, in particular for 
assessing the medium-to-long term survival of a subject having an ovarian cancer, or 
alternatively or in addition, for assessing the likelihood of disease recurrence. 

Accordingly, a further aspect of the present invention provides a method of monitoring 
the efficacy of a therapeutic treatment of ovarian cancer, the method comprising: 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

(ii) determining the level of a ovarian cancer-associated transcript in the 
biological sample by contacting the biological sample with a polynucleotide 
that selectively hybridizes to a sequence having at least about 80% identity 
to a sequence as shown in any one of Tables 1-3, thereby monitoring the 
efficacy of the therapy. 

Preferably the method further comprises comparing the level of the ovarian 
cancer-associated transcript to a level of the ovarian cancer-associated transcript in a 
biological sample from the patient prior to, or earlier in, the therapeutic treatment 

In a related embodiment, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of ovarian cancer, the method comprising : 

(i) providing a biological sample from a patient undergoing the. therapeutic . 
treatment; and 

(ii) determining the level of a ovarian cancer-associated antibody in the 
biological sample by contacting the biological sample with a polypeptide 
encoded by a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1-3, wherein the 
polypeptide specifically binds to the ovarian cancer-associated antibody, 
thereby monitoring the efficacy of the therapy. 

Preferably the method further comprises comparing the level of the ovarian 
cancer-associated antibody to a level of the ovarian cancer-associated antibody in a 
biological sample from the patient prior to, or earlier in, the therapeutic treatment 
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In a further related embodiment, the present invention provides a method of monitoring 
the efficacy of a therapeutic treatment of ovarian cancer, the method comprising : 

(i) providing a biological sample from a patient undergoing the therapeutic 

treatment; and 

5 (ii) determining the level of a ovarian cancer-associated polypeptide in the 

biological sample by contacting the biological sample with an antibody, wherein 
the antibody specifically binds to a polypeptide encoded by a polynucleotide that 
selectively hybridizes to a sequence at least 80% identical to a sequence as 
shown in Tables 1-3, thereby monitoring the efficacy of the therapy. 

10 

Preferably the method further comprises comparing the level of the ovarian 
cancer-associated polypeptide to a level of the ovarian cancer-associated polypeptide in 
a biological sample from the patient prior to, or earlier in, the therapeutic treatment 

15 It will also be apparent from the following preferred embodiments, that the expression of 
certain genes listed in Table 1 and Table 3 is statistically correlated with survival and 
death of patients having ovarian cancer, wherein a low P value indicates an enhanced 
likelihood that a patient having altered expression of the gene will die from the cancer. 

Accordingly, in one embodiment, the present invention provides a method of determining 
the likelihood of survival of a subject suffering from an ovarian cancer, said method 
cpmprising contacting a biological sample from said subject being tested with a nucleic 
acid probe for a time and under conditions sufficient for hybridization to occur and then 
detecting the hybridization wherein an elevated level of hybridization of the probe for the 
subject being tested compared to the hybridization obtained for a control subject not 
having ovarian cancer indicates that the subject being tested has a poor probability of 
survival, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of: NM_003014, AA046217, NMJ)15902, T83882, AB040888, 
AA628980, AI623351, AW614420, AA243499, AF251237, AI970797, AF145713, 
X78565, T97307, BE243845, AW068302, AL1 33561, BE313555, X07820, 
AI973016, AF084545, U41518, Z11894, AW138190, BE086548, W47196, 
AI796870, X02761, AW968613, AW972565, AF045229, AW953853, U52426, 
F06700, AI798863, H52761, BE546947, AU076643, U20536, AA581602, 
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AJ245210, X65965, AI806770, BE386490, AW581992, U77534, AL034417, 
L10343, AW518944, W28729, AI640160, U11862, AW295980, X59135, 
BE466173, AI354722, M90464, AA829286, AI333771, BE465867, NM_014992, 
BE616902, AA430373, R27430, BE387335, AW264102, AW952323, AA088177, 
5 BE614567, AL079658, NM_002776, BE261944, NM_006379, AI002238, X81789, 

NM_002122, AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, 
N72403, BE003054, AL035588, AI080491, AW770994, H24177, AF1 46761, 
NM_001 955, AI680737, AI752666, AA505445, BE246649, and NM_003955; 
(ii) a sequence that hybridizes under at least low stringency hybridization conditions 

10 to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 

and having an Accession Number selected from the group consisting of: 
NM_003014, AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, 
AW614420, AA243499, AF251237, AI970797, AF145713, X78565, T97307, 
BE243845, AW068302, AL133561, BE313555, X07820, AI973016, AF084545, 

15 U41518, Z11894, AW138190, BE086548, W47196, AI796870, X02761, 

AW968613, AW972565, AF045229, AW953853, U52426, F06700, AI798863, 
H52761, BE546947, AU076643, U20536, AA581602, AJ245210, X65965, 
AI806770, BE386490, AW581992, U77534, AL034417, L10343, AW518944, 
W28729, AI640160, U11862, AW295980, X59135, BE466173, AI354722, 

20 M90464, AA829286, AI333771, BE465867, NM_014992, BE616902, AA430373, 

R27430, BE387335, AW264102, AW952323, AA088177, BE614567, AL079658, 
NM_002776, BE261944, NM_006379, AI002238, X81789, NM_002122. 
AB001914. AA311919, AI381750, AA292998. BE439580, AI677897, N72403, 
BE003054, AL035588, AI080491, AW770994, H24177, AF146761, NM_001955, 

25 AI680737, AI752666, AA505445, BE246649, and NM_003955; 

(Hi) a sequence that is at least about 80% identical to (i) or (ii); 
(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 and having an Accession Number selected from the group consisting of: 
NM_003014, AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, 

30 AW614420, AA243499, AF251237, AI970797, AF145713, X78565, T97307. 

BE243845, AW068302, AL1 33561, BE313555, X07820, AI973016, AF084545, 
U41518, Z11894, AW138190, BE086548, W47196, AI796870, X02761, 
AW968613, AW972565, AF045229, AW953853, U52426, F06700, AI798863, 
H52761, BE546947, AU076643, U20536, AA581602, AJ245210, X65965, 

35 AI806770, BE386490, AW581992, U77534. AL034417, L10343, AW518944, 

W28729, AI640160, U11862. AW295980, X59135, BE466173, AI354722, 
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M90464, AA829286, AI333771, BE465867, NM_014992, BE616902, AA430373, 
R27430, BE387335, AW264102, AW952323, AA088177, BE614567, AL079658, 
NMJ)02776, BE261944, NMJJ06379, AI002238, X81789, NM_002122, 
AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, N72403, 
5 BE003054, AL035588, AI080491, AW770994, H24177, AF1 46761, NMJ)01955, 

AI680737, AI752666, AA505445, BE246649, and NMJJ03955; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

10 In a preferred embodiment, the present invention provides a method of determining the 
likelihood of survival of a subject suffering from an ovarian cancer, said method 
comprising contacting a biological sample from said subject being tested with a nucleic 
acid probe for a time and under conditions sufficient for hybridization to occur and then 
detecting the hybridization wherein an elevated level of hybridization of the probe for the 

15 subject being tested compared to the hybridization obtained for a control subject not 
having ovarian cancer indicates that the subject being tested has a poor probability of 
survival, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
20 selected from the group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 

79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 

25 (iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84; and 

(v) a sequence that is complementary to (i) or (ii) or (iii) or (iv). 

30 

In an alternative preferred embodiment, the present invention provides a method of 
determining the likelihood of survival of a subject suffering from an ovarian cancer, said 
method comprising contacting a biological sample from said subject being tested with an 
antibody for a time and under conditions sufficient for an antigen-antibody complex to 
35 form and then detecting the complex wherein an enhanced level of the antigen-antibody 
complex for the subject being tested compared to the amount of the antigen-antibody 
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complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has has a poor probability of survival, and wherein said antibody binds to a 
polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a sequence encoded by a nucleic acid set forth in Table 1 and 
5 having an Accession Number selected from the group consisting of: NM_003014, 
AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, AW614420, 
AA243499, AF251237, AI970797, AF145713, X78565, T97307, BE243845, AW068302, 
AL1 33561, BE313555, X07820, AI973016, AF084545, U41518, Z11894, AW138190, 
BE086548, W47196, AI796870, X02761, AW968613, AW972565, AF045229, 

10 AW953853, U52426, F06700, AI798863, H52761, BE546947, AU076643, U20536, 
AA581602, AJ245210, X65965, AI806770, BE386490, AW581992, U77534, AL034417, 
L10343, AW518944, W28729, AI640160, U11862, AW295980, X59135, BE466173, 
AI354722, M90464, AA829286, AI333771, BE465867, NMJ314992, BE616902, 
AA430373, R27430, BE387335, AW264102, AW952323, AA088177, BE614567, 

15 AL079658, NM_002776, BE261944, NM_006379, AI002238, X81789, NM_002122 ( 
AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, N72403, 
BE003054, AL035588, AI080491, AW770994, H24177, AF146761, NM_001955, 
AI680737, AI752666, AA505445, BE246649, and NM_003955. 

20 In an alternative preferred embodiment, the present invention provides a method of 
determining the likelihood of survival of a subject suffering from an ovarian cancer, said 
method comprising contacting a biological sample from said subject being tested with an 
antibody for a time and under conditions sufficient for an antigen-antibody complex to 
form and then detecting the complex wherein an enhanced level of the antigen-antibody 

25 complex for the subject being tested compared to the amount of. the antigen-antibody 
complex formed for a control subject not having ovarian cancer indicates that the subject 
being tested has has a poor probability of survival, and wherein said antibody binds to a 
polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a sequence having at least about 80% identity to a sequence 

30 selected from the group consisting of SEQ ID NOs: 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 
and 84. 

In a particularly preferred embodiment, the present invention provides a marker for 
determining the likelihood of a subject surviving from serous cancer. In accordance with 
35 this embodiment of the invention, there is provided a method of determining the 
likelihood of survival of a subject suffering from a serous ovarian cancer, said method 
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comprising contacting a biological sample from said subject being tested with a nucleic 
acid probe for a time and under conditions sufficient for hybridization to occur and then 
detecting the hybridization wherein an elevated level of hybridization of the probe for the 
subject being tested compared to the hybridization obtained for a control subject not 
5 having ovarian cancer indicates that the subject being tested has a poor probability of 
survival, and wherein said nucleic acid probe comprises a sequence selected from the 
group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 

acid comprising the nucleotide sequence set forth in SEQ ID NO: 71 or 73; 
10 (ii) a sequence that hybridizes under at least low stringency hybridization conditions 

to at least about 20 contiguous nucleotides from a nucleic acid comprising the 

nucleotide sequence set forth in SEQ ID NO: 71 or 73; 
(iii) a sequence that is at least about 80% identical to (i) or (ii) and encoding an sFRP 

protein or a SOCS3 protein; 
15 (iv) a sequence that encodes a polypeptide comprising the amino acid sequence set 

forth in SEQ ID NO: 72 or 74; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 

(ii) or (iii) or (iv). 

20 In an alternative preferred embodiment, the present invention provides a method of 
determining the likelihood of survival of a subject suffering from a serous ovarian cancer, 
said method comprising contacting a biological sample from said subject being tested 
with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein an enhanced level of the 

25 antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a control subject not having ovarian cancer 
indicates that the subject being tested has a poor probability of survival, and wherein said 
antibody binds to an sFRP polypeptide comprising the amino acid sequence set forth in 
SEQ ID NO: 72 or a SOCS3 polypeptide comprising the amino acid sequence set forth in 

30 SEQ ID NO: 74 or. 

It will also be apparent from the following preferred embodiments, that the expression of 
certain genes listed in Table 1 and Table 3 is statistically correlated with recurrence of 
ovarian cancer, wherein a low P value indicates an enhanced likelihood that a patient 
35 having altered expression of the gene will experience recurrence of the disease. 
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In yet another preferred embodiment, the present invention provides a method of 
determining the likelihood that a subject will suffer from a recurrence of an ovarian 
cancer, said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 
5 hybridization to occur and then detecting the hybridization wherein an elevated level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a high probability of recurrence, and wherein said nucleic acid probe 
comprises a sequence selected from the group consisting of: 
10 (i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of: M86849, AW963419, BE298665, AK000637, BE077546, T97307, 
R24601, BE090176, AA393907, W28729, BE313754, AW673081, AA356694, 
L08239, BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573, 
15 BE564162, BE439580, AW067800, AA569756, AW138190, AF1 26245, L10343, 

NM_002514, AI863735, NM_005397, W26391, H15474, U51166, AA243499, 
AW408807, AI738719, AB040888, BE313077, AI677897, C14898, AI821730, 
AF007393, H65423, N46243, AA095971, U20350, NM_005756, D19589, 
AW957446, AW294647, BE159718, AI888490, AA022569, BE1 47740, AI798863, 
. 20 BE464341, AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, 

AW953853, AU076611, AW968613, AL353944, BE614149, AA292998, H12912, 
AA188763, AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, 
AK001564, AW959861, BE313555, W25005, AI193356, AF111106, AI130740, 
AA985190, BE221880, AF084545, R26584, AW247380, AA364261, U25849, 
25 AF262992, AW342140, AL1 33572, AI497778, AI745379, U51712, AW375974, 

AF251237, NM_000636, AA130986, AA216363, AA628980, AA811657, 
AA897108, AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, 
AI827248, AK002039, AL1 09791, AW090198, AW296454, AW445034, 
AW452948, AW470411, AW885727, AW970859, AW979189, BE165866, 
30 BE175582, BE242587, BE271927, BE439580, BE464016, D63216, F34856, 

M83822, N33937, N49068, N51357, N80486, NMJ300954, NM_005756, 
NM_016652, R26584, R31178, W05391, W25005, W45393, W68815, X65965, 
X76732andZ45051, 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
35 to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 

and having an Accession Number selected from the group consisting of: M86849, 
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AW963419, BE298665, AK000637, BE077546, T97307, R24601, BE090176, 
AA393907, W28729, BE313754, AW673081, AA356694, L08239, BE397649, 
NM_012317, NM_000947, AJ250562, AL040183, BE207573, BE564162, 
BE439580, AW067800, AA569756, AW138190, AF126245, L10343, NM_002514, 
5 AI863735, NM^005397, W26391, H15474, U51166, AA243499, AW408807, 

AI738719, AB040888, BE313077, AI677897, C14898, AI821730, AF007393, 
H65423, N46243, AA095971, U20350, NM_005756, D19589, AW957446, 
AW294647, BE159718, AI888490, AA022569, BE147740, AI798863, BE464341. 
AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, AW953853, 

10 AU076611, AW968613, AL353944, BE614149. AA292998, H12912, AA188763, 

AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, AK001564, 
AW959861, BE313555, W25005, AI193356, AF111106, AI130740, AA985190, 
BE221880, AF084545, R26584, AW247380, AA364261, U25849, AF262992, 
AW342140, AL1 33572, AI497778, AI745379, U51712, AW375974, AF251237, 

15 NM_000636, AA130986, AA216363, AA628980, AA811657, AA897108, 

AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, AI827248, 
AK002039, AL1 09791, AW090198, AW296454, AW445034, AW452948, 
AW470411. AW885727. AW970859, AW979189, BE1 65866, BE1 75582, 
BE242587, BE271927, BE439580, BE464016, D63216, F34856, M83822, 

20 N33937, N49068, N51357, N80486, NM_000954, NM_005756, NM_016652, 

R26584, R31178, W05391, W25005. W45393, W68815, X65965, X76732 and 
Z45051; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
25 Table 1 and having an Accession Number selected from the group consisting 

of:M86849, AW963419, BE298665, AK000637, BE077546, T97307, R24601, 
BE090176, AA393907, W28729, BE313754, AW673081, AA356694, L08239, 
BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573, 
BE564162, BE439580, AW067800, AA569756, AW138190, AF126245, L10343, 

30 NM_002514, AI863735, NM_005397, W26391, H15474, U51166, AA243499, 

AW408807, AI738719, AB040888, BE313077, AI677897, C14898, AI821730, 
AF007393, H65423, N46243, AA095971, U20350, NM_005756, D19589, 
AW957446, AW294647, BE159718, AI888490, AA022569, BE1 47740, AI798863, 
BE464341, AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, 

35 AW953853, AU076611, AW968613, AL353944, BE614149, AA292998, H12912, 

AA1 88763, AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, 
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AK001564, AW959861, BE313555, W25005, AI193356, AF111106, AI130740, 
AA985190, BE221880, AF084545, R26584, AW247380, AA364261, U25849, 
AF262992, AW342140, AL133572, AI497778, AI745379, U51712, AW375974, 
AF251237, NM_000636, M130986. AA216363, AA628980, AA811657, 
5 AA897108, AB040888, AF212225, AI089575, AI282028, AI368826. AI718702. 

AI827248, AK002039, AL1 09791, AW090198, AW296454, AW445034, 
AW452948, AW470411, AW885727, AW970859, AW979189, BE165866, 
BE1 75582, BE242587, BE271927, BE439580, BE464016, D63216, F34856, 
M83822, N33937, N49068, N51357, N80486, NM_000954, NM_005756, 
10 NM_016652, R26584, R31178, W05391, W25005, W45393, W68815, X65965, 

X76732 and Z45051; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

15 In an alternative preferred embodiment, the present invention provides a method of 
determining the likelihood that a subject will suffer from a recurrence of an ovarian 
cancer, said method comprising contacting a biological sample from said subject being 
tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein an enhanced level of the 

20 antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a control subject not having ovarian cancer 
indicates that the subject being tested has a high probability of recurrence, and wherein 
said antibody binds to a polypeptide comprising an amino acid sequence comprising at 
least about 10 contiguous amino acid residues of a sequence encoded by a nucleic acid 

25 set forth in Table 1 and having an Accession Number selected from the group consisting 
of: M86849, AW963419, BE298665, AK000637, BE077546, T97307, R24601, 
BE090176, AA393907, W28729, BE313754, AW673081, AA356694, L08239, 
BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573, BE564162, 
BE439580, AW067800, AA569756, AW138190, AF1 26245, L10343, NM_002514, 

30 AI863735, NM_005397, W26391, H15474, U51166, AA243499, AW408807, AI738719, 
AB040888, BE313077, AI677897, C14898, AI821730, AF007393, H65423, N46243, 
AA095971, U20350, NM_005756, D19589, AW957446, AW294647, BE159718, 
AI888490, AA022569, BE147740, AI798863, BE464341, AL080235, AI557212, X75208, 
AA628980, BE242587, NM_005512, AW953853, AU076611, AW968613, AL353944, 

35 BE614149, AA292998, H 1291 2, AA1 88763, AK000596, AI970797, AW519204, Z42387, 
AF145713, AA972412, AK001564, AW959861, BE313555, W25005, AI193356, 
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AF111106, AM 30740, AA985190, BE221880, AF084545, R26584, AW247380, 
AA364261, U25849, AF262992, AW342140, AL133572, AI497778, AI745379, U51712, 
AW375974, AF251237, NMJ)00636, AA130986, AA216363, AA628980, AA811657, 
AA897108, AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, 
5 AI827248, AK002039, AL1 09791, AW090198, AW296454, AW445034, AW452948, 
AW470411, AW885727, AW970859, AW979189, BE1 65866, BE1 75582, BE242587, 
BE271927, BE439580, BE464016, D63216, F34856, M83822, N33937, N49068, 
N51357, N80486, NMJJ00954, NM_005756, NMJ)16652, R26584, R31178, W05391, 
W25005, W45393, W68815, X65965, X76732 and Z45051 . 

10 

The recurrence of ovarian cancer is a clinical recurrence as determined by the presence 
of one or more clinical symptoms of an ovarian cancer, such as, for example, a 
metastases, or alternatively, as determined in a biochemical test, immunological test or 
serological test such as, for example, a cross-reactivity in a biological sample to a CA125 
15 antibody. 

Preferably, the recurrence is capable of being detected at least about 2 years from 
treatment, more preferably about 2-3 years from treatment, and even more preferably 
about 4 or 5 or 1 0 years from treatment. 

20 

Preferably, in the above diagnostic and/or prognostic methods, the biological sample is 
contacted with a plurality of the nucleic acids and/or polypeptides and/or antibodies 
referred to above. In a particularly preferred embodiment, mulpiplex assays are 
performed to detect enhanced expression at least of sFRP4 and SOC3 at the protein 
25 level (eg., using antigen-based or antibody-based assays) or at the mRNA level (eg., by 
detecting elevated levels of mRNA transcripts). 

A further embodiment of the present invention provides a method of diagnosing epithelial 
ovarian cancer by detecting aberrant methylation of a promoter that regulates expression 

30 of a tumor suppressor gene eg., MCC. In particular, the present invention contemplates 
the detection of hypermethylation of the promoter of a tumor suppressor gene. Without 
being bound by any theory or mode of action, such hypermethylation leads to gene . 
inactivation, thereby reducing expression fo the tumor suppressor gene and permitting 
oncogenesis. In one preferred embodiment, the present invention provides a method of 

35 diagnosing an ovarian cancer in a human or animal subject being tested said method 
comprising determining aberrant methylation in a promoter sequence that regulates 
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expression of a tumor suppresspr gene in a biological sample from said subject 
compared to the methylation of the promoter in nucleic acid obtained for a control subject 
not having ovarian cancer wherein said aberrant methylation indicates that the subject 
being tested has an ovarian ovarian cancer. 

5 

In a further aspect, the present invention provides a method for identifying a compound 
that modulates an ovarian cancer-associated polypeptide, the method comprising : 

(i) contacting the compound with a ovarian cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 

10 at least 80% identical to a sequence as shown in Tables 1-3; and 

(ii) determining the functional effect of the compound upon the polypeptide. 

The functional effect may, for example, be a physical effect or a chemical effect. In one 
embodiment, the functional effect is determined by measuring iigand binding to the 
15 polypeptide. In a particular embodiment, the polypeptide is expressed in a eukaryotic 
host cell or cell membrane. Preferably the polypeptide is recombinant. 



In another aspect, the present invention provides a method of inhibiting proliferation of a 
ovarian tumour cell, which method comprises contacting said cell with a compound 
20 identified using the method supra for identifying a compound that modulates an ovarian 
cancer-associated polypeptide. 

In a further aspect, the present invention provides a method of inhibiting proliferation of a 
ovarian cancer-associated cell to treat ovarian cancer in a patient, the method 
25 comprising the step of administering to the patient a therapeutically effective amount of a 
compound identified using the method supra for identifying a compound that modulates 
an ovarian cancer-associated polypeptide. 

In a further aspect, the present invention provides a drug screening assay comprising : 
30 (i) administering a test compound to a mammal having ovarian cancer or a cell 

isolated therefrom; 

(ii) comparing the level of gene expression of a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1-3 in a treated cell or mammal with the level of gene expression of the 
35 polynucleotide in a control cell or mammal, wherein a test compound that 
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modulates the level of expression of the polynucleotide is a candidate for the 
treatment of ovarian cancer. 

Typically, the control is a mammal with ovarian cancer or a cell therefrom that has not 
5 been treated with the test compound. Alternatively, the control is a normal cell or 
mammal. 

The present invention also provides a method for treating a mammal having ovarian 
cancer comprising administering a compound identified the drug screening method 
10 supra. 

In a further aspect, the present invention provides a pharmaceutical composition for use 
in treating a mammal having ovarian cancer, the composition comprising a compound 
identified the screening method supra for identifying a compound that modulates an 
15 ovarian cancer-associated polypeptide, or alternatively, using the drug screening method 
supra, and a physiologically acceptable carrier or diluent. 

In a further aspect, the present invention provides an assay device, preferably for use in 
the diagnosis or prognosis of ovarian cancer, said device comprising a plurality of 
20 polynucleotides immobilized to a solid phase, wherein each of said polnucleotides 
consists of a gene as listed in any one of Tables 1-3. Preferably, the solid phase is a 
substantially planar chip. 

In a related embodiment, the present invention provides an assay device, preferably for 
25 use in the diagnosis or prognosis of ovarian cancer, said device comprising a plurality of 
different antibodies immobilized to a solid phase, wherein each of said antibodies binds 
to a polypeptide listed in Tables 1-3. Preferably, the solid phase is a substantially planar 
chip. 

30 Preferably, the assay device supra is used in a method of diagnosis or prognosis as 
described herein. 

Alternatively, the assay device is used to identify modulatory compounds of the 
expression of one or more genes/proteins listed in any one of Tables 1-3. 

35 



WO 2004/022778 PCT/AU2003/001166 

32 

In a further aspect, the present invention provides a non-human transgenic animal which 
is transgenic by virtue of comprising a gene set forth in any one of Tables 1-3 and, in 
particular, to the use of any such transgenic animal in the performance of a diagnostic or 
prognostic method of the invention as transgenic "knock-out" animals that have disrupted 
5 expression of a gene as set forth in any one of Tables 1 -3. 

In a further aspect, the present invention provides an isolated polynucleotide selected 
from the group consisting of; 

(a) polynucleotides comprising a nucleotide sequence as shown in Tables 1-3, 
10 or the complement thereof; 

(b) polynucleotides comprising a nucleotide sequence capable of selectively 
hybridizing to a nucleotide sequence as shown in Tables 1-3; 

(c) polynucleotides comprising a nucleotide sequence capable of selectively 
hybridizing to the complement of a nucleotide sequence as shown in Tables 1-3; 

15 and 

(d) polynucleotides comprising a polynucleotide sequence which is degenerate 
as a result of the genetic code to the polynucleotides defined in (a), (b) or (c). 

Preferred polynucleotides comprise a polynucleotide sequence as shown in Tables 1-3 
20 or a sequence having at least 80% homology thereto. 

Preferably, the isolated polynucleotide is used for the diagnosis or prognosis of ovarian 
cancer, more preferably by a method as described herein. . In a particularly preferred 
embodiment, the present invention provides for the use of a polynucleotide as set forth in 
25 any one of Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the 
preparation of a medicament for the treatment of ovarian cancer. 

The present invention also provides a nucleic acid vector comprising a polynucleotide of the 
invention. In one embodiment, the polynucleotide is operably linked to a regulatory 
30 control sequence capable of directing expression of the polynucleotide in a host cell. In a 
particularly preferred embodiment, the present invention provides for the use of a vector 
comprising a polynucleotide as set forth in any one of Tables 1-3 in the diagnosis or 
prognosis of ovarian cancer or for the preparation of a medicament for the treatment of 
ovarian cancer. 



35 
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The present invention further provides a host cell comprising a vector as described in the 
preceding paragraph. In a particularly preferred embodiment, the present invention 
provides for the use of a host cell comprising an introduced polynucleotide as set forth in 
any one of Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the 
5 preparation of a medicament for the treatment of ovarian cancer. 

In a further aspect, the present invention provides an isolated polypeptide which is encoded 
by a gene set forth in any one of Tables 1-3. The present invention also provides an 
isolated polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 
10 at least 80% identical to a sequence as shown in Tables 1-3. In a particularly preferred 
embodiment, the present invention provides for the use of an isolated polypeptide as set 
forth in any one of Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the 
preparation of a medicament for the treatment of ovarian cancer. 

In a further aspect the present invention provides an antibody that binds specifically a 
polypeptide listed in Tables 1-3. In a particularly preferred embodiment, the present 
invention provides for the use of an antibody that binds to an isolated polypeptide as set 
forth in any one of Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the 
preparation of a medicament for the treatment of ovarian cancer. 

Brief description of the Drawings 

Figure 1 is a photographic representation showing expression of genes as identified by 
immunohistochemical staining of fixed normal (i.e. non-cancerous or healthy) tissues 
25 (panel A) or ovarian cancer tissue (panel B). The inset in panel A shows inclusion cysts. 
The expression levels of the following genes listed in Table 1 or Table 3 were 
determined: Claudin-3 (SEQ ID NO: 15); EP-CAM (Accession No. NMJ)02354); and 
SOX17 (SEQ ID NO: 17). Positive controls CA125, MUC-1 and E-Cadherin were also 
included for comparison. 



15 
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Figure 2 is a graphical representation showing the correlation between expression of 
different genes in serous ovarian cancer (SOC), mucinous ovarian cancer (MOC), 
endometroid ovarian cancer (EnOC) and clear cell ovarian cancer (CICA). Genes 
5 indicated on the x-axis in each case are as in the legend to Figure 1 . 

Figure 3 is a copy of a photographic representation showing immunohistochemical 
staining of ovary tissue from a normal healthy subject (normal ovary) r a subject 
diagnosed with mucinous ovarian cancer (MOC) and a subject diagnosed with serous 
10 ovarian cancer (SOC), following staining with probes that are specific for L1-Cadherin 
(top row), meprin alpha (middle row). or gaIectin-4 (lower row). Magnification is indicated 
as 20-40X. 

Figure 4a is a copy of a photographic representation showing immunohistochemical 
15 staining of samples from a normal healthy subject (normal) or primary serous ovarian 
tumor (SOC), following staining with probes that are specific for sFRP4 (top row), or 
SOCS3 (lower row). Magnification is indicated as 20X. 

Figure 4b is a copy of a graphical representation showing a Kaplan-Meier survival curve 
20 correlating sFRP4 expression to patient survival over the medium term (i.e., from about 
12 months to about 48 months) to long term (more than about 48 months), indicating that 
high expression of sFRP4 is associated with poor survival in patients (n=127) having 
SOC (p=0.0056). 

25 Detailed description of the preferred embodiments 
Ovarian cancer-associated sequences 

Ovarian cancer-associated sequences can include both nucleic acid (i.e., "ovarian 
cancer-associated genes") and protein (i.e., "ovarian cancer-associated proteins"). 

30 As used herein, the term "ovarian cancer-associated protein" shall be taken to mean any 
protein that has an expression pattern correlated to an ovarian cancer, the recurrence of 
an ovarian cancer or the survival of a subject suffering from ovarian cancer. 



WO 2004/022778 PCT/AU2003/001 166 

35 

Similarly, the term "ovarian cancer-associated gene" shall be taken to mean any nucleic 
acid encoding an ovarian cancer-associated protein or nucleic acid having an expression 
profile that is correlated to an ovarian cancer, the recurrence of an ovarian cancer or the 
survival of a subject suffering from ovarian cancer. 

As will be appreciated by those in the art and is more fully outlined below, ovarian 
cancer-associated genes are useful in a variety of applications, including diagnostic 
applications, which will detect naturally occurring nucleic acids, as well as screening 
applications; e.g., biochips comprising nucleic acid probes or PGR microtitre plates with 
selected probes to the ovarian cancer sequences are generated. 

For identifying ovarian cancer-associated sequences, the ovarian cancer screen typically 
includes comparing genes identified .in different tissues, e.g., normal and cancerous 
tissues, or tumour tissue samples from patients who have metastatic disease vs. non 
metastatic tissue. Other suitable tissue comparisons include comparing ovarian cancer 
samples with metastatic cancer samples from other cancers, such as lung, breast, 
gastrointestinal cancers, ovarian, etc. Samples of different stages of ovarian cancer, e.g., 
survivor tissue, drug resistant states, and tissue undergoing metastasis, are applied to 
biochips comprising nucleic acid probes. The samples are first microdissected, if 
applicable, and treated as is known in the art for the preparation of mRNA. Suitable 
biochips are commercially available, e.g. from Affymetrix. Gene expression profiles as 
described herein are generated and the data analyzed. 

In one embodiment, the genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably 
normal ovarian, but also including, and not limited to lung, heart, brain, liver, breast, 
kidney, muscle, colon, small intestine, large intestine, spleen, bone and placenta. In a 
preferred embodiment, those genes identified during the ovarian cancer screen that are 
expressed in any significant amount in other tissues are removed from the profile, 
although in some embodiments, this is not necessary. That is, when screening for drugs, 
it is usually preferable that the target be disease specific, to minimise possible side 
effects. 

In a preferred embodiment, ovarian cancer-associated sequences are those that are up- 
regulated in ovarian cancer; that is, the expression of these genes is modifed (up- 
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regulated or down-regulated) in ovarian cancer tissue as compared to non-cancerous 
tissue (see Table 1). 

"Up-regulation" as used herein means at least about a two-fold change, preferably at 
5 least about a three fold change, with at least about five-fold or higher being preferred. All 
Unigene cluster identification numbers and accession numbers herein are for the 
GenBank sequence database and the sequences of the accession numbers are hereby 
expressly incorporated by reference. Sequences are also available in other databases, 
e.g., European Molecular Biology Laboratory (EMBL) and DNA Database of Japan 
10 (DDBJ). 

"Down-regulation" as used herein often means at least about a 1.5-fold change more 
preferably a two-fold change, preferably at least about a three fold change, with at least 
about five-fold or higher being most preferred. 

15 

Particularly preferred sequences are those referred to in Tables 1 or 3 that have a P 
value of less than 0.05, more preferably a P value of less than about 0.01 . 

Similarly, preferred sequences are those referred to in Table 2 as having an absolute 
20 ratio of expression between ovarian patients and normal patients of at least about ±5.0, 
more preferably at least about ±6.0, even more preferrably at leat about ±7.0 or at least 
about ±8.0 or at least about ±9.0 or at least about ±10.0. 

Detection of ovarian cancer sequences for diagnostic/prognostic applications 
25 in one aspect, the RNA expression levels of genes are determined for different cellular 
states in the ovarian cancer phenotype. Expression levels of genes in normal tissue (i.e., 
not undergoing ovarian cancer) and in ovarian cancer tissue (and in some cases, for 
varying severities of ovarian cancer that relate to prognosis, as outlined below) are 
evaluated to provide expression profiles. An expression profile of a particular cell state or 
30 point of development is essentially a "fingerprint" of the state. While two states may have 
any particular gene similarly expressed, the evaluation of a number of genes 
simultaneously allows the generation of a gene expression profile that is reflective of the 
state of the cell. By comparing expression profiles of cells in different states, information 
regarding which genes are important (including both up- and down-regulation of genes) 
35 in each of these states is obtained. Then, diagnosis are performed or confirmed to 
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determine whether a tissue sample has the gene expression profile of normal or 
cancerous tissue. This will provide for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to qualitative 
or quantitative differences in the temporal and/or cellular gene expression patterns within 
and among cells and tissue. Thus, a differentially expressed gene can qualitatively have 
its expression altered, including an activation or inactivation, in, e.g., normal versus 
ovarian cancer tissue. Genes are turned on or turned off in a particular state, relative to 
another state thus permitting comparison of two or more states. A qualitatively regulated 
gene will exhibit an expression pattern within a state or cell type which is detectable by 
standard techniques. Some genes will be expressed in one state or cell type, but not in 
both. Alternatively, the difference in expression are quantitative, e.g., in that expression is 
increased or decreased; i.e., gene expression is either upregulated, resulting in an 
increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify 
via standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChipTM expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited 
to, quantitative reverse transcriptase PCR, northern analysis and RNase protection. As 
outlined above, preferably the change in expression (i.e., upregulation or downregulation) 
is at least about 50%, more preferably at least about 100%, more preferably at least 
about 150%, more preferably at least about 200%, with from 300 to at least 1000% being 
especially preferred. 

Evaluation are at the gene transcript, or the protein level. The amount of gene expression 
are monitored using nucleic acid probes to the DNA or RNA equivalent of the gene 
transcript, and the quantification of gene expression levels, or, alternatively, the final 
gene product itself (protein) are monitored, e.g., with antibodies to the ovarian cancer- 
associated protein and standard immunoassays (ELISAs, etc.) or other techniques, 
including mass spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins 
corresponding to ovarian cancer genes, i.e., those identified as being important in a 
ovarian cancer phenotype, are evaluated in a ovarian cancer diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed on a plurality of 
genes. Multiple protein expression monitoring are performed as well. Similarly, these 
assays are performed on an individual basis as well. 
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In this embodiment, the ovarian cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of ovarian cancer sequences in a 
particular cell. The assays are further described below in the example. PCR techniques 
5 are used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the ovarian cancer-associated protein 
are detected. Although DNA or RNA 'encoding the ovarian cancer-associated protein are 
detected, of particular interest are methods wherein an mRNA encoding a ovarian 

10 cancer-associated protein . is detected. Probes to detect mRNA are a 
nucleotide/deoxynucleotide probe that is complementary to and hybridizes with the 
mRNA and includes, but is not limited to, oligonucleotides, cDNA or RNA. Probes also 
should contain a detectable label, as defined herein. In one method the mRNA is 
detected after immobilizing the nucleic acid to be examined on a solid support such as 

15 nylon membranes and hybridizing the probe with the sample. Following washing to 
remove the non-specifically bound probe, the label is detected. In another method 
detection of the mRNA is performed in situ. In this method permeabilized cells or tissue 
samples are contacted with a detectably labeled nucleic acid probe for sufficient time to 
allow the probe to hybridize with the target mRNA. Following washing to remove the 

20 non-specifically bound probe, the label is detected. For example a digoxygenin labeled 
riboprobe (RNA probe) that is complementary to the mRNA encoding a ovarian cancer- 
associated protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

25 

In a preferred embodiment, various proteins from the three classes of proteins as 
described herein (secreted, transmembrane or intracellular proteins) are used in 
diagnostic assays. The ovarian cancer-associated proteins, antibodies, nucleic acids, 
modified proteins and cells containing ovarian cancer sequences are used in diagnostic 
30 assays. This are performed on an individual gene or corresponding polypeptide level. In 
a preferred embodiment, the expression profiles are used, preferably in conjunction with 
high throughput screening techniques to allow monitoring for expression profile genes 
and/or corresponding polypeptides. 

35 As described and defined herein, ovarian cancer-associated proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of ovarian cancer. 
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Detection of these proteins in putative ovarian cancer tissue allows for detection or 
diagnosis of ovarian cancer. In one embodiment, antibodies are used to detect ovarian 
cancer-associated proteins. A preferred method separates proteins from a sample by 
electrophoresis on a gel (typically a denaturing and reducing protein gel, but are another 
5 type of gel, including isoelectric focusing gels and the like). Following separation of 
proteins, the ovarian cancer-associated protein is detected, e.g., by immunoblotting with 
antibodies raised against the ovarian cancer-associated protein. Methods of 
immunoblotting are well known to those of ordinary skill in the art. 

10 In another preferred method, antibodies to the ovarian cancer-associated protein find use 
in in situ imaging techniques, e.g., iri histology (e.g., Methods in Cell Biology: Antibodies 
in Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from 
one to many antibodies to the ovarian cancer-associated protein(s). Following washing to 
remove non-specific antibody binding, the presence of the antibody or antibodies is 

15 detected, in one embodiment the antibody is detected by incubating with a secondary 
antibody that contains a detectable label. In another method the primary antibody to the 
ovarian cancer-associated proteins) contains a detectable label, e.g. an enzyme marker 
that can act on a substrate, in another preferred embodiment each one of multiple 
primary antibodies contains a distinct and detectable label. This method finds particular 

20 use in simultaneous screening for a plurality of ovarian cancer-associated proteins. As 
will be appreciated by one of ordinary skill in the art, many other histological imaging 
techniques are also provided by the invention. 

in a preferred embodiment the label is detected in a fluorometer which has the ability to 
25 detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FACS) are used in the method. In another preferred embodiment, 
antibodies find use in diagnosing ovarian cancer from blood, serum, plasma, stool, and 
other samples. Such samples, therefore, are useful as samples to be probed or tested for 
the presence of ovarian cancer-associated proteins. Antibodies are used to detect a 
30 ovarian cancer-associated protein by previously described immunoassay techniques 
including EL1SA, immunoblotting (western blotting), immunoprecipitation, BIACORE 
technology and the like. Conversely, the presence of antibodies may indicate an immune 
response against an endogenous ovarian cancer-associated protein. 

35 In a preferred embodiment, in situ hybridization of labeled ovarian cancer nucleic acid 
probes to tissue arrays is done. For example, arrays of tissue samples, including ovarian 
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cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on 
the findings. It is further understood that the genes which indicate the diagnosis may 
5 differ from those which indicate the prognosis and molecular profiling of the condition of 
the cells may lead to distinctions between responsive or refractory conditions or are 
predictive of outcomes. 

In a preferred embodiment, the ovarian cancer-associated proteins, antibodies, nucleic 
10 acids, modified proteins and cells containing ovarian cancer sequences are used in 
prognosis assays. As above, gene expression profiles are generated that correlate to 
ovarian cancer, in terms of long term prognosis. Again, this are done on either a protein 
or gene level, with the use of genes being preferred. As above, ovarian cancer probes 
are attached to biochips for the detection and quantification of ovarian cancer sequences 
15 in a tissue or patient. The assays proceed as outlined above for diagnosis. PCR method 
may provide more sensitive and accurate quantification. 

Characteristics of ovarian cancer-associated proteins and genes encoding same 
Ovarian cancer-associated proteins of the present invention are classified as secreted 

20 proteins, transmembrane proteins or intracellular proteins. In one embodiment, the 
ovarian cancer-associated protein is an intracellular protein. Intracellular proteins are 
found in the cytoplasm and/or in the nucleus. Intracellular proteins are involved in all 
aspects of cellular function and replication (including, e.g., signaling pathways); aberrant 
expression of such proteins often results in unregulated or disregulated cellular 

25 processes (see, e.g., Molecular Biology of the Cell (Alberts, ed., 3rd ed., 1994). For 
example, many intracellular proteins have enzymatic activity such as protein kinase 
activity, protein phosphatase activity, protease activity, nucleotide cyclase activity, 
polymerase activity and the like. Intracellular proteins also serve as docking proteins that 
are involved in organizing complexes of proteins, or targeting proteins to various 

30 subcellular localizations, and are involved in maintaining the structural integrity of 
organelles. 

An increasingly appreciated concept in characterising proteins is the presence in the 
proteins of one or more motifs for which defined functions have been attributed. In 
35 addition to the highly conserved sequences found in the enzymatic domain of proteins, 
highly conserved sequences have been identified in proteins that are involved in 
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protein-protein interaction. For example, Src-homology-2 (SH2) domains bind 
tyrosine-phosphorylated targets in a sequence dependent manner. PTB domains, which 
are distinct from SH2 domains, also bind tyrosine phosphorylated targets. SH3 domains 
bind to proline-rich targets. In addition, PH domains, tetratricopeptide repeats and WD 
5 domains to name only a few, have been shown to mediate protein-protein interactions. 
Some of these may also be involved in binding to phospholipids or other second 
messengers. As will be appreciated by one of ordinary skill in the art, these motifs are 
identified on the basis of primary sequence; thus, an analysis of the sequence of proteins 
may provide insight into both the enzymatic potential of the molecule and/or molecules 

10 with which the protein may associate. One useful database is Pfam (protein families), 
which is a large collection of multiple sequence alignments and hidden Markov models 
covering many common protein domains. Versions are available via the internet from 
Washington University in St. Louis, the Sanger Center in England, and the Karolinska 
Institute in Sweden (see, e.g., Bateman et al., 2000, Nuc. Acids Res. 28: 263-266; 

15 Sonnhammer et al., 1997, Proteins 28: 405-420; Bateman et al., 1999, Nuc. Acids Res. 
27:260-262; and Sonnhammer et al., 1998, Nuc. Acids Res. 26: 320-322. 



In another embodiment, the ovarian cancer sequences are transmembrane proteins. 
Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They 

20 may have an intracellular domain, an extracellular domain, or both. The intracellular 
domains of such proteins may have a number of functions including those already 
described for intracellular proteins. For example, the intracellular domain may have 
enzymatic activity and/or may serve as a binding site for additional proteins. Frequently 
the intracellular domain of transmembrane proteins serves both roles. For example 

25 certain receptor tyrosine kinases have both protein kinase activity and SH2 domains. In 
addition, autophosphorylation of tyrosines on the receptor molecule itself, creates binding 
sites for additional SH2 domain containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. For 
30 example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl 
cyclases and receptor serine/threonine protein kinases contain a single transmembrane 
domain. However, various other proteins including channels and adenylyl cyclases 
contain numerous transmembrane domains. Many important cell surface receptors such 
as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
35 domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
transmembrane domains include approximately 20 consecutive hydrophobic amino acids 



WO 2004/022778 



PCT/AU2003/001166 



that are followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane domains 
within the protein are predicted (see, e.g. PSORT web site http://psort.nibb.ac.jp/). 
Important transmembrane protein receptors include, but are not limited to the insulin 
5 receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 
transporters, transferrin receptor, epidermal growth factor receptor, low density 
lipoprotein receptor, epidermal growth factor receptor, leptin receptor, interleukin 
receptors, e.g. IL-1 receptor, IL-2 receptor, 

10 The extracellular domains of transmembrane proteins are diverse; however, conserved 
motifs are found repeatedly among various extracellular domains. Conserved structure 
and/or functions have been ascribed to different extracellular motifs. Many extracellular 
domains are involved in binding to other molecules. In one aspect, extracellular domains 
are found on receptors. Factors that bind the receptor domain include circulating ligands, 

15 which are peptides, proteins, or small molecules such as adenosine and the like. For 
example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors 
include cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular 
domains also bind to cell-associated molecules. In this respect, they mediate cell-cell 

20 interactions., Cell-associated ligands are tethered to the cell, e.g., via a 
glycosylphosphatidylinositol (GPI) anchor, or may themselves be. transmembrane 
proteins. Extracellular domains also associate with the extracellular matrix and contribute 
to the maintenance of the cell structure. 

25 Ovarian cancer-associated proteins that are transmembrane are particularly preferred in 
the present invention as they are readily accessible targets for immunotherapeutics, as 
are described herein. In addition, as outlined below, transmembrane proteins are also 
useful in imaging modalities. Antibodies are used to label such readily accessible 
proteins in situ. Alternatively, antibodies can also label intracellular proteins, in which 

30 case samples are typically permeablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein are made 
soluble by removing transmembrane sequences, e.g., through recombinant methods. 
Furthermore, transmembrane proteins that have been made soluble are made to be 
35 secreted through recombinant means by adding an appropriate signal sequence. 
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In another embodiment, the ovarian cancer-associated proteins are secreted proteins; 
the secretion of which are either constitutive or regulated. These proteins have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous . physiological events; by virtue of their circulating 
5 nature, they serve to transmit signals to various other cell types. The secreted protein 
may function in an autocrine manner (acting on the cell that secreted the factor), a 
paracrine manner (acting on cells in close proximity to the cell that secreted the factor) or 
an endocrine manner (acting on cells at a distance). Thus secreted molecules find use in 
modulating or altering numerous aspects of physiology. Ovarian cancer-associated 
10 proteins that are secreted proteins are particularly preferred in the present invention as 
they serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, or 
stool tests. 

Mammalian subjects 

15 The present invention provides nucleic acid and protein sequences that are differentially 
expressed in ovarian cancer, herein termed "ovarian cancer sequences." As outlined 
below, ovarian cancer sequences include those that are up-regulated (i.e., expressed at 
a higher level) in ovarian cancer, as well as those that are down-regulated (i.e., 
expressed at a lower level). In a preferred embodiment, the ovarian cancer sequences 

20 are from humans; however, as will be appreciated by those in the art, ovarian cancer 
sequences from other organisms are useful in animal models of disease and drug 
evaluation; thus, other ovarian cancer sequences are provided, from vertebrates, 
including mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, 
farm animals (including sheep, goats, pigs, cows, horses, etc.) and pets, e.g., (dogs, 

25 cats, etc.). 

Assay control samples 

It will be apparent from the preceding discussion that many of the diagnostic methods 
provided by the present invention involve a degree of quantification to determine, on the 

30 one hand, the over-expression or reduced-expression of a diagnostic/prognostic marker 
in tissue that is suspected of comprising a cancer cell. Such quantification can be readily 
provided by the inclusion of appropriate control samples in the assays described below, 
derived from healthy or normal individuals. Alternatively, if internal controls are not 
included in each assay conducted, the control may be derived from an established data 

35 set that has been generated from healthy or normal individuals. 
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In the present context, the term "healthy individual" shall be taken to mean an individual 
who is known not to suffer from ovarian cancer, such knowledge being derived from 
clinical data on the individual, including, but not limited to, a different cancer assay to that 
described herein. As the present invention is particularly useful for the early detection of 
5 ovarian cancer, it is preferred that the healthy individual is asymptomatic with respect to 
the early symptoms associated with ovarian cancer. Although early detection using well- 
known procedures is difficult, reduced urinary frequency, rectal pressure, and abdominal 
bloating and swelling, are associated with the disease in its early stages, and, as a 
consequence, healthy individuals should not have any of these clinical symptoms. 
10 Clearly, subjects suffering from later symptoms associated with ovarian cancer, such as, 
for example, metastases in the omentum, abdominal fluid, lymph nodes, lung, liver, brain, 
or bone, and subjects suffering from spinal cord compression, elevated calcium level, 
chronic pain, or pleural effusion, should also be avoided from the "healthy individual" data 
set. 

15 

The term "normal individual" shall be taken to mean an individual having a normal level of 
expression of a cancer-associate gene or cancer-associated protein in a particular 
sample derived from said individual. As will be known to those skilled in the art, data 
obtained from a sufficiently large sample of the population will normalize, allowing the 

20 generation of a data set for determining the average level , of a particular parameter. 
Accordingly, the level of expression of a cancer-associate gene or cancer-associated 
protein can be determined for any population of individuals, and for any sample derived 
from said individual, for subsequent comparison to levels determined for a sample being 
assayed. Where such normalized data sets are relied upon, internal controls are 

25 preferably included in each assay conducted to control for variation. 

In one embodiment, the present invention provides a method for detecting a cancer cell 
in a subject, said method comprising: 

(i) determining the level of mRNA encoding a cancer-associated protein expressed 
30 in a test sample from said subject; and 

(ii) comparing the level of mRNA determined at (i) to the level of mRNA encoding a 
cancer-associated protein expressed in a comparable sample from a healthy or 
normal individual, 

wherein a level of mRNA at (i) that is modified in the test sample relative to the 
35 comparable sample from the normal or healthy individual is indicative of the presence of 
a cancer cell in said subject. 
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Alternatively, or in addition, the control! may comprise a cancer-associated sequence that 
is known to be expressed at a particular level in an ovarian cancer, eg., CA125, MUC-1 
or E-Cadherin, amongast others. 

5 

Biological samples 

Preferred biological samples in which the assays of the invention are performed include 
bodily fluids, ovarian tissue and cells, and those tissues known to comprise cancer cells 
arising from a metastasis of an ovarian cancer, such as, for example, in carcinomas of 
10 the lung, prostate, breast, colon, pancreas, placenta, or omentum , and in cells of brain 
anaplastic oligodendrogliomas. 

Bodily fluids shall be taken to include whole blood, serum, peripheral blood mononuclear 
cells (PBMC), or buffy coat fraction. 

15 

In the present context, the term "cancer cell" includes any biological specimen or sample 
comprising a cancer cell irrespective of its degree of isolation or purity, such as, for 
example, tissues, organs, cell lines, bodily fluids, or histology specimens that comprise a 
cell in the early stages of transformation or having been transformed. 

20 

As the present invention is particularly useful for the early detection and prognosis of 
cancer ofe rthe medium to long term, the definition of "cancer cell" is not to be limited by 
the stage of a cancer in the subject from which said cancer cell is derived (ie. whether or 
not the patient is in remission or undergoing disease recurrence or whether or not the 
25 cancer is a primary tumor or the consequence of metastases). Nor is the term "cancer 
cell" to be limited by the stage of the cell cycle of said cancer cell. 

Preferably, the sample comprises ovarian tissue, prostate tissue, kidney tissue, uterine 
tissue, placenta, a cervical specimen, omentum, rectal tissue, brain tissue, bone tissue, 

30 lung tissue, lymphatic tissue, urine, semen, blood, abdominal fluid, or serum, or a cell 
preparation or nucleic acid preparation derived therefrom. More preferably, the sample 
comprises serum or abdominal fluid, or a tissue selected from the group consisting of: 
ovary, lymph, lung, liver, brain, placenta, brain, omentum, and prostate. Even more 
preferably, the sample comprises serum or abdominal fluid, ovary (eg. OSE), or lymph 

35 node tissue. The sample can be prepared on a solid .matrix for histological analyses, or 
alternatively, in a suitable solution such as, for example, an extraction buffer or 
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suspension buffer, and the present invention clearly extends to the testing of biological 
solutions thus prepared. 



Polynucleotide probes and amplification primers 
5 Polynucleotide probes are derived from or comprise the nucleic acid sequences whose 
nucleotide sequences are provided by reference to the public database accession 
numbers given in Tables 1-3 (referred to herein as the nucleotide sequences shown in 
Tables 1-3), and sequences homologues thereto as well as variants, derivatives and 
fragments thereof . 

10 

Whilst the probes may comprise double-stranded or single-stranded nucleic acid, single- 
stranded probes are preferred because they do not require melting prior to use in 
hybridizations. On the other hand, longer probes are also preferred because they can be 
used at higher hybridization stringency than shorter probes and may produce lower 
15 background hybridization than shorter probes. 

So far as shorter probes are concerned, single-stranded, chemically-synthesized 
oligonucleotide probes are particularly preferred by the present invention. To reduce the 
noise associated with the use of such probes during hybridization, the nucleotide 

20 sequence of the probe is carefully selected to maximize the Tm at which hybridizations 
can be performed, reduce non-specific hybridization, and to reduce self-hybridization. 
Such considerations may be particularly important for applications involving high 
throughput screening using microarray technology. In general, this means that the 
nucleotide sequence of an oligonucleotide probe is selected such that it is unique to the 

25 target RNA or protein-encoding sequence, has a low propensity to form secondary 
structure, low self-complementary, and is not highly A/T-rich. 

The only requirement for the probes is that they cross-hybridize to nucleic acid encoding 
the target diagnostic protein or the complementary nucleotide sequence thereto and are 

30 sufficiently unique in sequence to generate high signalrnoise ratios under specified 
hybridization conditions. As will be known to those skilled in the art, long nucleic acid 
probes are preferred because they tend to generate higher signalrnoise ratios than 
shorter probes and/or the duplexes formed between longer molecules have higher 
melting temperatures (i.e. Tm values) than duplexes involving short probes. Accordingly, 

35 full-length DNA or RNA probes are contemplated by the present invention, as are specific 
probes comprising the sequence of the 3'-untranslated region or complementary thereto. 
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In a particularly preferred embodiment, the nucleotide sequence of an oligonucleotide 
probe has no detectable nucleotide sequence identity to a nucleotide sequence in a 
BLAST search (Altschul et a/., J. Mol. BioL 215, 403-410, 1990) or other database 
5 search, other than a sequence selected from the group consisting of: (a) a sequence 
encoding a polypeptide listed in any one of Tables 1-3; (b) the 5-u translated region of a 
sequence encoding a polypeptide listed in any one of Tables 1-3; (c) a 3'-untranslated 
region of a sequence encoding a polypeptide listed in any one of Tables 1-3; and (d) an 
exon region of a sequence encoding a polypeptide listed in any one of Tables 1-3. 

10 

Additionally, the self-complementarity of a nucleotide sequence can be determined by 
aligning the sequence with its reverse complement, wherein detectable regions of identity 
are indicative of potential self-complementarity. As will be known to those skilled in the 
art, , such sequences may not necessarily form secondary structures during hybridization 

15 reaction, and, as a consequence, successfully identify a target nucleotide sequence. It is 
also known to those skilled in the art that, even where a sequence does form secondary 
structures during hybridization reactions, reaction conditions can be modified to reduce 
the adverse consequences of such structure formation. Accordingly, a potential for self- 
complementarity should not necessarily exclude a particular candidate oligonucleotide 

20 from selection. In cases where it is difficult to determine nucleotide sequences having no 
potential self-complementarity, the uniqueness of the sequence should outweigh a 
consideration of its potential for secondary structure formation. 

Recommended pre-requisites for selecting oligonucleotide probes, particularly with 
25 respect to probes suitable for microarray technology, are described in detail by Lockhart 
et a/., "Expression monitoring by hybridization to high-density oligonucleotide arrays", 
Nature Biotech. 14, 1675-1680, 1996. 

The nucleic acid probe may comprise a nucleotide sequence that is within the coding 
30 strand of a gene listed in any one of Tables 1-3. Such "sense" probes are useful for 
detecting RNA by amplification procedures, such as, for example, polymerase chain 
reaction (PCR), and more preferably, quantitative PCR or reverse transcription 
polymerase chain reaction (RT-PCR). Alternatively, "sense" probes may be expressed to 
produce polypeptides or immunologically active derivatives thereof that are useful for 
35 detecting the expressed protein in samples. 
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The nucleotide sequences referred to in Tables 1-3 and homologues thereof, typically 
encode polypeptides. It will be understood by a skilled person that numerous different 
polynucleotides can encode the same polypeptide as a result of the degeneracy of the 
5 genetic code. In addition, it is to be understood that skilled persons may, using routine 
techniques, make nucleotide substitutions that do not affect the polypeptide sequence 
encoded by the polynucleotides of the invention to reflect the codon usage of any particular 
host organism in which the polypeptides of the invention are to be expressed. 

10 Polynucleotides may comprise DNA or RNA. They are single-stranded or double- 
stranded. They may also be polynucleotides which include within them synthetic or 
modified nucleotides. A number of different types of modification to oligonucleotides are 
known in the art. These include methylphosphonate and phosphorothioate backbones, 
addition of acridine or polylysine chains at the 3' and/or 5' ends of the molecule. For the 

15 purposes of the present invention, it is to be understood that the polynucleotides 
described herein are modified by any method available in the art. Such modifications are 
carried out in order to enhance the in vivo activity or life span of the diagnostic/prognostic 
polynucleotides. 

20 The terms "variant" or "derivative" in relation to the nucleotide sequences of the present 
invention include any substitution of, variation of, modification of, replacement of, deletion of 
or addition of one (or more) nucleic acid from or to the sequence provided that the resultant 
nucleotide sequence codes for a polypeptide having biological activity, preferably having 
substantially the same activity as the polypeptide sequences presented in the sequence 

25 listings. 

With respect to sequence homology, preferably there is at least 75%, more preferably at 
least 85%, more preferably at least 90% homology to a sequence shown in Tables 1-3 
herein over a region of at least 20, preferably at least 25 or 30, for instance at least 40, 60, 
30 100, 500, 1000 or more contiguous nucleotides. More preferably there is at least 95%, 
more preferably at least 98%, homology. In one embodiment, homologues are naturally 
occurring sequences, such as orthologues, tissue-specific isoforms and allelic variants. 

Homology comparisons are conducted by eye, or more usually, with the aid of readily 
35 available sequence comparison programs. These commercially available computer 
programs can calculate % homology between two or more sequences. 
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Percentage (%) homology are calculated over contiguous sequences, i.e. one sequence is 
aligned with the other sequence and each nucleotide in one sequence directly compared 
with the corresponding nucleotide in the other sequence, one base at a time. This is called 
5 an "ungapped" alignment. Typically, such ungapped alignments are performed only over a 
relatively short number of bases (for example less than 50 contiguous nucleotides). 

Although this is a very simple and consistent method, it fails to take into consideration that, 
for example, in an otherwise identical pair of sequences, one insertion or deletion will cause 

10 the following nucleotides to be put out of alignment, thus potentially resulting in a large 
reduction in % homology when a global alignment is performed. Consequently, most 
sequence comparison methods are designed to produce optimal alignments that take into 
consideration possible insertions and deletions without penalising unduly the overall 
homology score. This is achieved by inserting "gaps" in the sequence alignment to try to 

15 maximise local homology. 

However, these more complex methods assign "gap penalties" to each gap that occurs in 
the alignment so that, for the same number of identical amino acids, a sequence alignment 
with as few gaps as possible - reflecting higher relatedness between the two compared 

20 sequences - will achieve a higher score than one with many gaps. "Affine gap costs" are 
typically used that charge a relatively high cost for the existence of a gap and a smaller 
penalty for each subsequent residue in the gap. This is the most commonly used gap 
scoring system. High gap penalties will of course produce optimised alignments with fewer 
gaps. Most alignment programs allow the gap penalties to be modified. However, it is 

25 preferred to use the default values when using such software for sequence comparisons. 

In determining whether or not two amino acid sequences fall within the stated defined 
percentage identity limits, those skilled in the art will be aware that it is necessary to 
conduct a side-by-side comparison of amino acid sequences. In such comparisons or 

30 alignments, differences will arise in the positioning of non-identical amino acid residues 
depending upon the algorithm used to perform the alignment. In the present context, 
references to percentage identities and similarities between two or more amino acid 
sequences shall be taken to refer to the number of identical and similar residues 
respectively, between said sequences as determined using any standard algorithm 

35 known to those skilled in the art. In particular, amino acid identities and similarities are 
calculated using the GAP program of the Computer Genetics Group, Inc., University 
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Research Park, Madison, Wisconsin, United States of America (Devereaux et a/, Nucl. 
Acids Res. 12, 387-395,1984), which utilizes the algorithm of Needleman and Wunsch J. 
Mol. Biol. 48, 443-453, 1970, or alternatively, the CLUSTAL W algorithm of Thompson et 
a/., Nucl. Acids Res. 22, 4673-4680, 1994, for multiple alignments, to maximize the 
5 number of identical/similar amino acids and to minimize the number and/or length of 
sequence gaps in the alignment 

A suitable computer program for carrying out such an alignment is the GCG Wisconsin 
Bestfit package (University of Wisconsin, U.SA; Devereux et al., 1984, Nucleic Acids 
10 Research 12:387). The default scoring matrix has a match value of 10 for each identical 
nucleotide and -9 for each mismatch. The default gap creation penalty is -50 and the default 
gap extension penalty is -3 for each nucleotide. 

Examples of other software than can perform sequence comparisons include, but are not 
15 limited to, the BLAST package (see Ausubel et a/., 1999 ibid - Chapter 18), FASTA 
(Atschul et a!., 1990, J. Mol. Biol., 403-410) and the GENEWORKS suite of comparison 
tools. Both BLAST and FASTA are available for offline and online searching (see 
Ausubel ef a/., 1999 ibid, pages 7-58 to 7-60). However it is preferred to use the GCG 
Bestfit program. 

20 

Once the software has produced an optimal alignment, it is possible to calculate % 
homology, preferably % sequence identity. The software typically does this as part of the 
sequence comparison and generates a numerical result. 

25 A preferred sequence comparison program is the GCG Wisconsin Bestfit program 
described above. 

The present invention also encompasses the use of nucleotide sequences that are capable 
of hybridizing selectively to the sequences presented herein, or any variant, fragment or 
30 derivative thereof, or to the complement of any of the above. Nucleotide sequences are 
preferably at least 15 nucleotides in length, more preferably at least 20, 30, 40 or 50 
nucleotides in length. 

The term "hybridization" as used herein shall include "the process by which a strand of 
35 nucleic acid joins with a complementary strand through base pairing" as well as the 
process of amplification as carried out in polymerase chain reaction technologies. 
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Polynucleotides capable of selectively hybridizing to the nucleotide sequences presented 
herein, or to their complement, will be generally at least 70%, preferably at least 80 or 90% 
and more preferably at least 95% or 98% homologous to the corresponding nucleotide 
5 sequences referred to in Tables 1-3 over a region of at least 20, preferably at least 25 or 30, 
for instance at least 40, 60, 100, 500, 1000 or more contiguous nucleotides. 

. The term "selectively hybridizable" means that the polynucleotide used as a probe is used 
under conditions where a target polynucleotide is found to hybridize to the probe at a level 

10 significantly above background. The background hybridization may occur because of other 
polynucleotides present for example, in the cDNA or genomic DNA library being screening. 
In this event, background implies a level of signal generated by interaction between the 
probe and a non-specific DNA member of the library which is less than 10 fold, preferably 
less than 100 fold as intense as the specific interaction observed with the target DNA. The 

15 intensity of interaction are measured, for example, by radiolabelling the probe, e.g. with ^P. 

Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid 
binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular Cloning 
Techniques, Methods in Enzymology, Vol 152, Academic Press, San Diego CA), and 
20 confer a defined "stringency" as explained below. 

For the purposes of defining the level of stringency, a high stringency hybridization is 
achieved using a hybridization buffer and/or a wash solution comprising the following: 

(i) a salt concentration that is equivalent to 0.1xSSC-0.2xSSC buffer or lower salt 
25 concentration; 

(ii) a detergent concentration equivalent to 0.1 % (w/v) SDS or higher; and 

(iii) an incubation temperature of 55°C or higher. 

Conditions for specifically hybridizing nucleic acid, and conditions for washing to remove 
30 non-specific hybridizing nucleic acid, are well understood by those skilled in the art. For 
the purposes of further clarification only, reference to the parameters affecting 
hybridization between nucleic acid molecules is found in Ausubel et al. (Current 
Protocols in Molecular Biology, Wiley Interscience, ISBN 047150338, 1992), which is 
herein incorporated by reference. 

35 
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Maximum stringency typically occurs at about Tm-5°C (5°C below the Tm of the probe); 
high stringency at about 5°C to 10°C below Tm; intermediate stringency at about 10°C to 
20°C below Tm; and low stringency at about 20°C to 25°C below Tm. As will be 
understood by those of skill in the art, a maximum stringency hybridization are used to 
5 identify or detect identical polynucleotide sequences while an intermediate (or low) 
stringency hybridization are used to identify or detect similar or related polynucleotide 
sequences. 

In a preferred aspect, the present invention covers nucleotide sequences that can hybridize 
10 to the nucleotide sequence of the present invention under stringent conditions (e.g. 65°C 
and O.lxSSC {1xSSC = 0.15 M Nad, 0.015 M Na 3 Citrate pH 7.0}). 

Where the diagnostic/prognostic polynucleotide is double-stranded, both strands of the 
duplex, either individually or in combination, are encompassed by the present invention. 
15 Where the polynucleotide is single-stranded, it is to be understood that the complementary 
sequence of that polynucleotide is also included within the scope of the present invention. 

Polynucleotides which are not 100% homologous to the sequences of the present invention 
but are useful in perfoming the diagnostic and/or prognostic assays of the invetnion by 

20 virtue of their ability to selectively hybridize to the target gene transcript, or to encode an 
immunologically cross-reactive protein to the target protein, are obtained in a number of 
ways, such as, for example by probing DNA libraries made from a range of individuals, for 
example individuals from different populations. In particular, given that that changes in the 
expression of diagnostic/prognostic cancer-associated genes correlate with ovarian cancer, 

25 characterisation of variant sequences in individuals suffering from ovarian cancer is used to 
identify variations in the sequences of ovarian-cancer associated genes (and proteins) that 
are predictive of and/or causative of ovarian cancer. 

Accordingly the present invention provides methods of identifying sequence variants that 
30 are associated with ovarian cancer which methods comprise determining all or part of the 
nucleotide sequence of a gene referred to in Tables 1-3, derived from an individual suffering 
from ovarian cancer and comparing the sequence to that of the corresponding wild-type 
gene. 



35 



In addition, other viral/bacterial, or cellular homologues particularly cellular homologues 
found in mammalian cells (e.g. rat, mouse, bovine and primate cells), are obtained and such 
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homologues and fragments thereof in general will be capable of selectively hybridizing to 
the sequences shown in the sequence listing herein. Such sequences are obtained by 
probing cDNA libraries made from or genomic DNA libraries from other animal species, and 
probing such libraries with probes comprising all or part of the sequences referred to in 
5 Tables 1-3 under conditions of medium to high stringency. Similar considerations apply to 
obtaining species homologues and allelic variants of the nucleotide sequences referred to in 
Tables 1-3. 

Variants and strain/species homologues may also be obtained using degenerate PCR 
10. which will use primers designed to target sequences within the variants and homologues 
encoding conserved amino acid sequences within the sequences of the present invention. 
Conserved sequences are predicted, for example, by aligning the amino acid sequences 
from several variants/homologues. Sequence alignments are performed using computer 
software known in the art. For example the GCG Wisconsin PileUp program is widely used. 

15 

The primers used in degenerate PCR will contain one or more degenerate positions and will 
be used at stringency conditions lower than those used for cloning sequences with single 
sequence primers against known sequences. 

20 Alternatively, such polynucleotides are obtained by site-directed mutagenesis of 
characterised sequences, such as the sequences referred to in Tables 1-3. This are useful 
where for example silent codon changes are required to sequences to optimise codon 
preferences for a particular host cell in which the polynucleotide sequences are being 
expressed. Other sequence changes are desired in order to introduce restriction enzyme 

25 recognition sites, or to alter the property or function of the polypeptides encoded by the 
polynucleotides. 

Polynucleotides comprising a diagnostic/prognostic cancer-associated gene are used to 
produce a primer by standard derivatization means, e.g. a PCR primer, a primer for an 
30 alternative amplification reaction, a probe e.g. labelled with a detectable label by 
conventional means using radioactive or non-radioactive labels, or the polynucleotides are 
cloned into vectors. Such primers, probes and other fragments will be at least 15, preferably 
at least 20, for example at least 25, 30 or 40 nucleotides in length. Preferred fragments are 
less than 5000, 2000, 1000, 500 or 200 nucleotides in length. 



35 
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Polynucleotides such as a DNA polynucleotides and probes according to the invention are 
produced by recombinant or synthetic means, including cloning by standard techniques. 

In general, primers will be produced by synthetic means, involving a step wise manufacture 
5 of the desired nucleic acid sequence one nucleotide at a time. Techniques for 
accomplishing this using automated techniques are readily available in the. art. 

Longer polynucleotides will generally be produced using recombinant means, for example 
using PCR (polymerase chain reaction) cloning techniques. This will involve making a pair 

10 of primers (e.g. of about 15 to 30 nucleotides) flanking a region of the sequence which it is 
desired to clone, bringing the primers into contact with mRIMA or cDNA obtained from an 
3nimal or human cell, performing a polymerase chain reaction under conditions which bring 
about amplification of the desired region, isolating the amplified fragment (e.g. by purifying 
the reaction mixture on an agarose gel) and recovering the amplified DNA. The primers are 

15 designed to contain suitable restriction enzyme recognition sites so that the amplified DNA 
are cloned into a suitable cloning vector 

Polynucleotide probes or primers preferably carry a detectable label. Suitable labels 
include radioisotopes such as 32 P or 35 S f enzyme labels, or other protein labels such as 
20 biotin. Such labels are added to polynucleotides or primers and are detected using by 
techniques known in the art. 

Polynucleotide probes or primers labeled or unlabeled are also used by a person skilled 
in the art in nucleic acid-based tests for detecting or sequencing diagnostic/prognostic 
25 cancer-associated gene. 

Such tests for detecting generally comprise bringing a biological sample containing DNA 
or RNA into contact with a probe comprising a polynucleotide probe or primer under at 
least low stringency hybridization conditions and detecting any duplex formed between 

30 the probe/primer and nucleic acid in the sample. Such detection are achieved using 
techniques such as PCR or by immobilising the probe on a solid support, removing 
nucleic acid in the sample which is not hybridized to the probe, and then detecting 
nucleic acid which has hybridized to the probe. Alternatively, the sample nucleic acid are 
immobilised on a solid support, and the amount of probe bound to such a support are 

35 detected. Suitable assay methods of this and other formats are found in for example 
W089/03891 and W090/13667. 
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Tests for sequencing nucleotides include bringing a biological sample containing target 
DNA or RNA into contact with a probe comprising a polynucleotide probe or primer under 
at least low stringency hybridization conditions and determining the sequence by, for 
example the Sanger dideoxy chain termination method (see Sambrook et al.). 

Such a method generally comprises elongating, in the presence of suitable reagents, the 
primer by synthesis of a strand complementary to the target DNA or RNA and selectively 
terminating the elongation reaction at one or more of an A, C, G or T/U residue; allowing 
strand elongation and termination reaction to occur; separating out according to size the 
elongated products to determine the sequence of the nucleotides at which selective 
termination has occurred. Suitable reagents include a DNA polymerase enzyme, the 
deoxynucleotides dATP, dCTP, dGTP and dTTP, a buffer and ATP. Dideoxynudeotides 
are used for selective termination. 

Tests for detecting or sequencing nucleotides in a biological sample are used as part of 
the methods of the invention for detecting ovarian cancer-associated transcripts and 
monitoring the efficacy of treatment of patients suffering from ovarian cancer as 
described in more detail herein. 

The probes/primers may conveniently be packaged in the form of a test kit in a suitable 
container. In such kits the probe are bound to a solid support where the assay format for 
which the kit is designed requires such binding. The kit may also contain suitable 
reagents for treating the sample to be probed, hybridizing the probe to nucleic acid in the 
sample, control reagents, instructions, and the like. 

Preferably, a kit of the invention comprises primers/probes suitable for selectively 
detecting a plurality of sequences, more preferably for selectively detecting a plurality of 
sequences that are listed in one or more of Tables 1-3 as having a P value of less than 
0.05, more preferably a P value of less than 0.01. Similarly, a kit of the invention 
preferably comprises primers suitable for selectively detecting a plurality of sequences 
referred to in Table 1 or 2 or 3. 

Nucleic acid-based assay formats 

As discussed in detail below, the status of expression of a cancer-associated gene in 
patient samples may be analyzed by a variety protocols that are well known in the art 
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including in situ hybridization, northern blotting techniques, RT-PCR analysis (such as, 
for example, performed on laser capture microdissected samples), and microarray 
technology, such as, for. example, using tissue microarrays probed with nucleic acid 
probes, or nucleic acid microarrays (ie. RNA microarrays or amplified DNA microarrays) 
5 microarrays probed with nucleic acid probes. All such assay formats are encompassed 
by the present invention. 

For high throughput screening of large numbers of samples, such as, for example, public 
health screening of subjects, particularly human subjects, having a higher risk of 
10 developing cancer, microarray technology is a preferred assay format. 

In accordance with such high throughput formats, techniques for producing immobilised 
arrays of DNA molecules have been described in the art. Generally, most prior art 
methods describe how to synthesise single-stranded nucleic acid molecule arrays, using 

15 for example masking techniques to build up various permutations of sequences at the 
various discrete positions on the solid substrate. U.S. Patent No. 5,837,832, the contents 
of which are incorporated herein by reference, describes an improved method for 
producing DNA arrays immobilised to silicon substrates based on very large scale 
integration technology. In particular, U.S. Patent No. 5,837,832 describes a strategy called 

20 "tiling" to synthesize specific sets of probes at spatially-defined locations on a substrate 
which are used to produced the immobilised DNA arrays. U.S. Patent No. 5,837,832 also 
provides references for earlier techniques that may also be used. 

Thus DNA are synthesised in situ on the surface of the substrate. However, DNA may 
25 also be printed directly onto the substrate using for example robotic devices equipped 
with either pins or piezo electric devices. 

The plurality of polynucleotide sequences are typically immobilised onto or in discrete 
regions of a solid substrate. The substrate are porous to allow immobilisation within the 

30 substrate or substantially non-porous, in which case the library sequences are typically 
immobilised on the surface of the substrate. The solid substrate are made of any material 
to which polypeptides can bind, either directly or indirectly. Examples of suitable solid 
substrates include flat glass, silicon wafers, mica, ceramics and organic polymers such 
as plastics, including polystyrene and polymethacrylate. It may also be possible to use 

35 semi-permeable membranes such as nitrocellulose or nylon membranes, which are 
widely available. The semi-permeable membranes are mounted on a more robust solid 
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surface such as glass. The surfaces may optionally be coated with a layer of metal, such 
as gold, platinum or other transition metal. A particular example of a suitable solid 
substrate is the commercially available BIACore™ chip (Pharmacia Biosensors). 

5 Preferably, the solid substrate is generally a material having a rigid or semi-rigid surface. 
In preferred embodiments, at least one surface of the substrate will be substantially flat, 
although in some embodiments it are desirable to physically separate synthesis regions 
for different polymers with, for example, raised regions or etched trenches. It is also 
preferred that the solid substrate is suitable for the high density application of DNA 
10 sequences in discrete areas of typically from 50 to 100 pm, giving a density of 10000 to 
40000 cm* 2 . 

The solid substrate is conveniently divided up into sections. This are achieved by 
techniques such as photoetching, or by the application of hydrophobic inks, for example 
15 teflon-based inks (CeWine, USA). 

Discrete positions, in which each different member of the array is located may have any 
convenient shape, e.g., circular, rectangular, elliptical, wedge-shaped, etc. 

20 Attachment of the polynucleotide sequences to the substrate are by covalent or non- 
covalent means. The plurality of polynucleotide sequences are attached to the substrate 
via a layer of molecules to which the sequences bind. For example, the sequences are 
labelled with biotin and the substrate coated with avidin and/or streptavidin. A 
convenient feature of using biotinylated sequences is that the efficiency of coupling to the 

25 solid substrate are determined easily. Since the library sequences may bind only poorly 
to some solid substrates, it is often necessary to provide a chemical interface between 
the solid substrate (such as in the case of glass) and the sequences. Examples of 
suitable chemical interfaces include hexaethylene glycol. Another example is the use of 
polylysine coated glass, the polylysine then being chemically modified using standard 

30 procedures to introduce an affinity ligand. Other methods for attaching molecules to the 
surfaces of solid substrate by the use of coupling agents are known in the art, see for 
example W098/49557. 

The complete DNA array is typically read at the same time by charged coupled device 
35 (CCD) camera or confocal imaging system. Alternatively, the DNA array are placed for 
detection in a suitable apparatus that can move in an x-y direction, such as a plate 
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reader. In this way, the change in characteristics for each discrete position are 
measured automatically by computer controlled movement of the array to place each 
discrete element in turn in line with the detection means. 

The detection means are capable of interrogating each position in the library array 
optically or electrically. Examples of suitable detection means include CCD cameras or 
confocal imaging systems. 

In a preferred embodiment, the level of expression of the cancer-associated gene in the 
test sample is determined by hybridizing a probe/primer to RNA in the test sample under 
at least low stringency hybridization conditions and detecting the hybridization using a 
detection means. 

Similarly, the level of mRNA in the comparable sample from the healthy or normal 
individual is preferably determined by hybridizing a probe/primer to RNA in said 
comparable sample under at least low stringency hybridization conditions and detecting 
the hybridization using a detection means. 

For the purposes of defining the level of stringency to be used in these diagnostic 
assays, a low stringency is defined herein as being a hybridization and/or a wash carried 
out in 6xSSC buffer, 0.1% (w/v) SDS at 28°C, or equivalent conditions. A moderate 
stringency is defined herein as being a hybridization and/or washing carried out in 2xSSC 
buffer, 0.1% (w/v) SDS at a temperiature in the range 45°C to 65°C, or equivalent 
conditions. A high stringency is defined herein as being a hybridization and/or wash 
carried out in O.lxSSC buffer, 0.1% (w/v) SDS, or lower salt concentration, and at a 
temperature of at least 65°C, or equivalent conditions. Reference herein to a particular 
level of stringency encompasses equivalent conditions using wash/hybridization solutions 
other than SSC known to those skilled in the art. 

Generally, the stringency is increased by reducing the concentration of SSC buffer, 
and/or increasing the concentration of SDS and/or increasing the temperature of the 
hybridization and/or wash. Those skilled in the art will be aware that the conditions for 
hybridization and/or wash may vary depending upon the nature of the hybridization 
matrix used to support the sample RNA, or the type of hybridization probe used. 
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In general, the sample or the probe is immobilized on a solid matrix or surface (e.g., 
nitrocellulose). For high throughput screening, the sample or probe will generally 
comprise an array of nucleic acids on glass or other solid matrix, such as, for example, 
as described in WO 96/17958. Techniques for producing high density arrays are 
5 described, for example, by Fodor et a/., Science 767-773, 1991, and in U.S. Pat. No. 
5,143,854. Typical protocols for other assay formats can be found, for example in 
Current Protocols In Molecular Biology, Unit 2 (Northern Blotting), Unit 4 (Southern 
Blotting), and Unit 18 (PCR Analysis), Frederick M. Ausubul etal. (ed)., 1995. 

10 The detection means according to this aspect of the invention may be any nucleic acid- 
based detection means such as, for example, nucleic acid hybridization or amplification 
reaction (eg. PCR), a nucleic acid sequence-based amplification (NASBA) system, 
inverse polymerase chain reaction (iPCR), in situ polymerase chain reaction, or reverse 
transcription polymerase chain reaction (RT-PCR), amongst others. 

15 

The probe can be labelled with a reporter molecule capable of producing an identifiable 
signal (e.g., a radioisotope such as 32 P or ^S, or a fluorescent or biotinylated molecule). 
According to this embodiment, those skilled in the art will be aware that the detection of 
said reporter molecule provides for identification of the probe and that, following the 
20 hybridization reaction, the detection of the corresponding nucleotide sequences in the 
sample is facilitated. Additional probes can be used to confirm the assay results obtained 
using a single probe. 

Wherein the detection means is an amplification reaction such as, for example, a 
25 polymerase chain reaction or a nucleic acid sequence-based amplification (NASBA) 
system or a variant thereof, one or more nucleic acid probes molecules of at least about 
20 contiguous nucleotides in length is hybridized to mRNA encoding a cancer-associated 
protein, or alternatively, hybridized to cDNA or cRNA produced from said mRNA, and 
nucleic acid copies of the template are enzymically-amplified. 

30 

Those skilled in the art will be aware that there must be a sufficiently high percentage of 
nucleotide sequence identity between the probes and the RNA sequences in the sample 
template molecule for hybridization to occur. As stated previously, the stringency 
conditions can be selected to promote hybridization. 



35 
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In one format, PCR provides for the hybridization of non-complementary probes to 
different strands of a double-stranded nucleic acid template molecule (ie. a DNA/RNA, 
RNA/RNA or DNA/DNA template), such that the hybridized probes are positioned to 
facilitate the 5'-to 3' synthesis of nucleic acid in the intervening region, under the control 
5 of a thermostable DNA polymerase enzyme. In accordance with this embodiment, one 
sense probe and one antisense probe as described herein would be used to amplify DNA 
from the hybrid RNA/DNA template or cDNA. 

In the present context, the cDNA would generally be produced by reverse transcription of 
10 mRNA present in the sample being tested (ie. RT-PCR). RT-PCR is particularly useful 
when it is desirable to determine expression of a cancer-associated gene. It is also 
known to those skilled in the art to use m RNA/DNA hybrid molecules as a template for 
such amplification reactions, and, as a consequence, first strand cDNA synthesis is all 
that is required to be performed prior to the amplification reaction. 

Variations of the embodiments described herein are described in detail by McPherson et 
a/., PCR: A Practical Approach, (series eds, D. Rickwood and B.D. Hames), IRL Press 
Limited, Oxford. pp1-253, 1991. 

20 The amplification reaction detection means described supra can be further coupled to a 
classical hybridization reaction detection means to further enhance sensitivity and 
specificity of the inventive method, such as by hybridizing the amplified DNA with a probe 
which is different from any of the probes used in the amplification reaction. 

25 Similarly, the hybridization reaction detection means described supra can be further 
coupled to a second hybridization step employing a probe which is different from the 
probe used in the first hybridization reaction. 

The comparison to be performed in accordance with the present invention may be a 
30 visual comparison of the signal generated by the probe, or alternatively, a comparison of 
data integrated from the signal, such as, for example, data that have been corrected or 
normalized to allow for variation between samples. Such comparisons can be readily 
performed by those skilled in the art. 
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Polypeptides 

Cancer-associated polypeptides are encoded by cancer-associated genes. It will be 
understood that such polypeptides include those polypeptide and fragments thereof that 
5 are homologous to the polypeptides encoded by the nucleotide sequences referred to in 
Tables 1-3, which are obtained from any source, for example related viral/bacterial 
proteins, cellular homologues and synthetic peptides, as well as variants or derivatives 
thereof. 

10 Thus, the present invention encompasses the use of variants, homologues or derivatives 
of the cancer-associated proteins descirbed in the accompanying Tables. In one 
embodiment, homologues are naturally occurring sequences, such as orthologues, tissue- 
specific isoforms and allelic variants. 

15 In the context of the present invention, a homologous sequence is taken to include an 
amino acid sequence which is at least 60, 70, 80 or 90% identical, preferably at least 95 
or 98% identical at the amino acid level over at least 20, 40, 60 or 80 amino acids with a 
sequence encoded by a nucleotide sequence referred to in any one of Tables 1-3. In 
particular, homology should typically be considered with respect to those regions of the 

20 sequence known to be essential for specific biological functions rather than non-essential 
neighbouring sequences. 

Although amino acid homology can also be considered in terms of similarity (i.e. amino 
acid residues having similar chemical properties/functions), in the context of the present 
25 invention it is preferred to express homology in terms of sequence identity. 

Homology comparisons are carried out as described above for nucleotide sequences with 
the appropriate modifications for amino acid sequences. For example when using the GCG 
Wisconsin Bestfit package (see below) the default gap penalty for amino acid sequences is 
30 -1 2 for a gap and -4 for each extension. 

It should also be noted that where computer algorithms are used to align amino acid 
sequences, although the final % homology are measured in terms of identity, the 
alignment process itself is typically not based on an all-or-nothing pair comparison. 
35 Instead, a scaled similarity score matrix is generally used that assigns scores to each 
pairwise comparison based on chemical similarity or evolutionary distance. An example 
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of such a matrix commonly used is the BLOSUM62 matrix - the default matrix for the 
BLAST suite of programs. GCG Wisconsin programs generally use either the public 
default values or a custom symbol comparison table if supplied (see user manual for 
further details). It is preferred to use the public default values for the GCG package, or in 
5 the case of other software, the default matrix, such as BLOSUM62. 

The terms "variant" or "derivative" in relation to the amino acid sequences of the present 
invention includes any substitution of, variation of, modification of, replacement of, deletion 
of or addition of one (or more) amino acids from or to the sequence providing the resultant 
10 amino acid sequence preferably has biological activity, preferably having at least 25 to 50% 
of the activity as the polypeptides referred to in the sequence listings, more preferably at 
least substantially the same activity. Particular details of biological activity for each 
polypeptide are given in Tables 1-3. 

15 Thus, the polypeptides referred to in Tables 1-3 and homologues thereof, are modified 
for use in the present invention. Typically, modifications are made that maintain the 
activity of the sequence. Thus, in one embodiment, amino acid substitutions are made, 
for example from 1, 2 or 3 to 10, 20 or 30 substitutions provided that the modified 
sequence retains at least about 25 to 50% of, or substantially the same activity. 

20 However, in an alternative preferred embodiment, modifications to the amino acid 
sequences of a cancer-associated protein are made intentionally to reduce the biological 
activity of the polypeptide. For example truncated polypeptides that remain capable of 
binding to target molecules but lack functional effector domains are useful as inhibitors of 
the biological activity of the full length molecule. 

25 

In general, preferably less than 20%, 10% or 5% of the amino acid residues of a variant or 
derivative are altered as compared with the corresponding region of the polypeptides 
referred to in Tables 1-3. 

30 Amino acid substitutions may include the use of non-naturally occurring analogues, for 
example to increase blood plasma half-life of a therapeutically administered polypeptide 
(see below for further details on the production of peptide derivatives for use in therapy). 

Conservative substitutions are made, for example according to the Table below. Amino 
35 acids in the same block in the second column and preferably In the same line in the third 
column are substituted for each other 
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ALIPHATIC 


Non-polar 


GAP 






ILV 




Polar - uncharged 


CSTM 






NQ 




Polar - charged 


DE 






KR 


AROMATIC 




H FWY 



Cancer-associated proteins also include fragments of the above mentioned full length 
polypeptides and variants thereof, including fragments of the sequences referred to in 
Tables 1-3 and homologues thereof. Preferred fragments include those which include an 
epitope. Suitable fragments will be at least about 6 or 8, e.g. at least 10, 12, 15 or 20 amino 
acids in length. They may also be less than 200, 100 or 50 amino acids in length. 
Polypeptide fragments may contain one or more (e.g. 2, 3, 5, or 10) substitutions, deletions 
or insertions, including conserved substitutions. Where substitutions, deletion and/or 
insertions have been made, for example by means of recombinant technology, preferably 
less than 20%, 10% or 5% of the amino acid residues depicted in the sequence listings are 
altered. 

Cancer-associated proteins are preferably in a substantially isolated form. It will be 
understood that the protein are mixed with carriers or diluents which will not interfere with 
the intended purpose of the protein and still be regarded as substantially isolated. A 
cancer-associated protein of the invention may also be in a substantially purified form, in 
which case it will generally comprise the protein in a preparation in which more than 
90%, e.g. 95%, 98% or 99% pure as determined by SDS/PAGE or other art-recognized 
means for asessing protein purity. 

Protein Production 

For producing full-length polypeptides or immunologically active derivatives thereof by 
recombinant means, a protein-encoding region comprising at least about 15 contiguous 
nucleotides of the protein-encoding region of a nucleic acid referred to in any one of 
Tables 1-3 is placed in operable connection with a promoter or other regulatory 
sequence capable of regulating expression in a cell-free system or cellular system. 
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Reference herein to a "promoter" is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical genomic gene, including the TATA 
box which is required for accurate transcription initiation, with or without a CCAAT box 
sequence and additional regulatory elements (i.e., upstream activating sequences, 
5 enhancers and silencers) which alter gene expression in response to developmental 
and/or external stimuli, or in a tissue-specific manner. In the present context, the term 
"promoter" is also used to describe a recombinant, synthetic or fusion molecule, or. 
derivative which confers, activates or enhances the expression of a nucleic acid molecule 
to which it is operably connected,, and which encodes the polypeptide or peptide 
10 fragment. Preferred promoters can contain additional copies of one or more specific 
regulatory elements to further enhance expression and/or to alter the spatial expression 
and/or temporal expression of the said nucleic acid molecule. 

Placing a nucleic acid molecule under the regulatory control of, i.e., "in operable 
15 connection with", a promoter sequence means positioning said molecule such that 
expression is controlled by the promoter sequence. Promoters are generally positioned 
5' (upstream) to the coding sequence that they control. To construct heterologous 
promoter/structural gene combinations, it is generally preferred to position the promoter 
at a distance from the gene transcription start site that is approximately the same as the 
20 distance between that promoter and the gene it controls in its natural setting, i.e., the 
gene from which the promoter is derived. Furthermore, the regulatory elements 
comprising a promoter are usually positioned within 2 kb of the start site of transcription 
of the gene. As is known in the art, some variation in this distance can be accommodated 
without loss of promoter function. Similarly, the preferred positioning of a regulatory 
25 sequence element with respect to a heterologous gene to be placed under its control is 
defined by the positioning of the element in its natural setting, i.e., the genes from which 
it is derived. Again, as is known in the art, some variation in this distance can also occur. 

The prerequisite for producing intact polypeptides and peptides in bacteria such as E 
30 coli is the use of a strong promoter with an effective ribosome binding site. Typical 
promoters suitable for expression in bacterial cells such as E coli include, but are not 
limited to, the lacz promoter, temperature-sensitive A L or A R promoters, T7 promoter or 
the IPTG-inducible tac promoter. A number of other vector systems for expressing the 
nucleic acid molecule of the invention in E coli are well-known in the art and are 
35 described, for example, in Ausubel et al (In: Current Protocols in Molecular Biology. 
Wiley Interscience, ISBN 047150338, 1987) or Sambrook et al {In: Molecular cloning. A 
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laboratory manual, second edition, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
N.Y., 1989). Numerous plasmids with suitable promoter sequences for expression in 
bacteria and efficient ribosome binding sites have been described, such as for example, 
pKC30 (X L : Shimatake and Rosenberg, Nature 292, 128, 1981); pKK173-3 (fac: Amann 
5 and Brosius, Gene 40, 183, 1985), pET-3 (77: Studier and Moffat, J. Mol Biol 189, 113, 
1986); the pBAD/TOPO or pBAD/Thio-TOPO series of vectors containing an arabinose- 
inducible promoter (Invitrogen, Carlsbad, CA), the latter of which is designed to also 
produce fusion proteins with thioredoxin to enhance solubility of the expressed protein; 
the pFLEX series of expression vectors (Pfizer Inc., CT, USA); or the pQE series of 
10 expression vectors (Qiagen, CA), amongst others. 

Typical promoters suitable for expression in viruses of eukaryotic cells and eukaryotic 
cells include the SV40 late promoter, SV40 early promoter and cytomegalovirus (CMV) 
promoter, CMV IE (cytomegalovirus immediate early) promoter amongst others. 

15 Preferred vectors for expression in mammalian cells (eg. 293, COS, CHO, 293T cells) 
include, but are not limited to, the pcDNA vector suite supplied by Invitrogen, in particular 
pcDNA 3.1 myc-His-tag comprising the CMV promoter and encoding a C-terminal 6xHis 
and MYC tag; and the retrovirus vector pSRatkneo (Muller et a/., Mol Cell Biol, 11 \ 
1785, 1991). The vector pcDNA 3.1 myc-His (Invitrogen) is particularly preferred for 

20 expressing a secreted form of a protein in 293T cells, wherein the expressed peptide or 
protein can be purified free of conspecific proteins, using standard affinity techniques that 
employ a Nickel column to bind the protein via the His tag. 

A wide range of additional host/vector systems suitable for expressing polypeptides or 
25 immunological derivatives thereof are available publicly, and described, for example, in 
Sambrook et al {In: Molecular cloning. A laboratory manual, second edition, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1989). 

Means for introducing the isolated nucleic acid molecule or a gene construct comprising 
30 same into a cell for expression are well-known to those skilled in the art. The technique 
used for a given organism depends on the known successful techniques. Means for 
introducing recombinant DNA into animal cells include microinjection, transfection 
mediated by DEAE-dextran, transfection mediated by liposomes such as by using 
lipofectamine (Gibco, MD, USA) and/or cellfectin (Gibco, MD, USA), PEG-mediated DNA 
35 uptake, electroporation and microparticle bombardment such as by using DNA-coated 
tungsten or gold particles (Agracetus Inc., Wl, USA) amongst others. 
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For producing mutants, nucleotide insertion derivatives of the protein-encoding region 
are produced by making 5' and 3' terminal fusions, or by making intra-sequence 
insertions of single or multiple nucleotides or nucleotide analogues. Insertion nucleotide 
5 sequence variants are produced by introducing one or more nucleotides or nucleotide 
analogues into a predetermined site in the nucleotide sequence of said sequence, 
although random insertion is also possible with suitable screening of the resulting product 
being performed. Deletion variants are produced by removing one or more nucleotides 
from the nucleotide sequence. Substitutional nucleotide variants are produced by 

10 substituting at least one nucleotide in the sequence with a different nucleotide or a 
nucleotide analogue in its place, with the immunologically active derivative encoded 
therefor having an identical amino acid sequence , or only a limited number of amino 
acid modifications that do not alter its antigenicity compared to the base peptide or its 
ability to bind antibodies prepared against the base peptide. Such mutant derivatives will 

15 preferably have at least 80% identity with the base amino acid sequence from which they 
are derived. 

Preferred immunologically active derivatives of a full-length polypeptide encoded by a 
gene referred to in any one of Tables 1-3 will comprise at least about 5-10 contiguous 
20 amino acids of the full-length amino acid sequence, more preferably at least about 10-20 
contiguous amino acids in length, and even more preferably 20-30 contiguous amino 
acids in length. 

For the purposes of producing derivatives using standard peptide synthesis techniques, 
25 such as, for example, Fmoc chemistry, a length not exceeding about 30-50 amino acids 
in length is preferred, as longer peptides are difficult to produce at high efficiency. 
Longer peptide fragments are readily achieved using recombinant DNA techniques 
wherein the peptide is expressed in a cell-free or cellular expression system comprising 
nucleic acid encoding the desired peptide fragment. 

30 

It will be apparent to the skilled artisan that any sufficiently antigenic region of at least 
about 5-10 amino acid residues can be used to prepare antibodies that bind generally to 
the polypeptides listed in Tables 1-3. 

35 An expressed protein or synthetic peptide is preferably produced as a recombinant fusion 
protein, such as for example, to aid in extraction and purification. To produce a fusion 
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polypeptide, the open reading frames are covalently linked in the same reading frame, 
such as, for example, using standard cloning procedures as described by Ausubel et ai 
(Current Protocols in Molecular Biology, Wiley Interscience, ISBN 047150338, 1992), 
and expressed under control of a promoter. Examples of fusion protein partners include 
5 glutathione-S-transferase (GST), FLAG, hexahistidine, GAL4 (DNA binding and/or 
transcriptional activation domains) and p-galactosidase. It may also be convenient to 
include a proteolytic cleavage site between the fusion protein partner and the protein 
sequence of interest to allow removal of fusion protein sequences. Preferably the fusion 
protein will not hinder the immune function of the target protein. 

10 

In a particularly preferred embodiment, polypeptides are produced substantially free of 
conspecific proteins. Such purity can be assessed by standard procedures, such as, for 
example, SDS/polyacrylamide gel electrophoresis, 2-dimensional gene electrophoresis, 
chromatography, amino acid composition analysis, or amino acid sequence analysis. 

15 

To produce isolated polypeptides or fragments, eg., for antibody production, standard 
protein purification techniques may be employed. For example, gel filtration, ion 
exchange chromatography, reverse phase chromatography, or affinity chromatography, 
or a combination of any one or more said procedures, may be used. High pressure and 

20 low pressure procedures can also be employed, such as, for example, FPLC, or HPLC. 
To isolate the full-length proteins or peptide fragments comprising more than about 50- 
100 amino acids in length, it is particularly preferred to express the polypeptide in a 
suitable cellular expression system in combination with a suitable affinity tag, such as a 
6xHis tag, and to purify the polypeptide using an affinity step that bonds it via the tag 

25 {supra). Optionally, the tag may then be cleaved from the expressed polypeptide. 

Alternatively, for short immunologically active derivatives of a full-length polypeptide, 
preferably those peptide fragments comprising less than about 50 amino acids in length, 
chemical synthesis techniques are conveniently used. As will be known to those skilled 
30 in the art, such techniques may . also produce contaminating peptides that are shorter 
than the desired peptide, in which case the desired peptide is conveniently purified using 
reverse phase and/or ion exchange chromatography procedures at high pressure (ie. 
HPLC or FPLC). 

35 Antibodies 
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The invention also provides monoclonal or polyclonal antibodies that bind specifically to 
polypeptides of the invention or fragments thereof. Thus, the present invention further 
provides a process for the production of monoclonal or polyclonal antibodies to polypeptides 
of the invention. 

5 

The phrase "binds specifically" to a polypeptide means that the binding of the antibody to 
the protein or peptide is determinative of the presence of the protein, in a heterogeneous 
population of proteins and other biologies. Thus, under designated immunoassay 
conditions, the specified antibodies bind to a particular protein at least two times the 
10 background and more typically more than 10 to 100 times background. Typically, 
antibodies of the invention bind to a protein of interest with a Kd of at least about 0.1 mM, 
more usually at least about 1 ^M, preferably at least about 0.1 jiM, and most preferably at 
least, 0.01 [iM. 

15 Reference herein to antibody or antibodies includes whole polyclonal and monoclonal 
antibodies, and parts thereof, either alone or conjugated with other moieties. Antibody 
parts include Fab and F(ab>2 fragments and single chain antibodies. The antibodies may 
be made in vivo in suitable laboratory animals, or, in the case of engineered antibodies 
(Single Chain Antibodies or SCABS, etc) using recombinant DNA techniques in vitro. 

20 

In accordance with this aspect of the invention, the antibodies may be produced for the 
purposes of immunizing the subject, in which case high titer or neutralizing antibodies 
that bind to a B cell epitope will be especially preferred. Suitable subjects for 
immunization will, of course, depend upon the immunizing antigen or antigenic B cell 

25 epitope. It is contemplated that the present invention will be broadly applicable to the 
immunization of a wide range of animals, such as, for example, farm animals (e.g. 
horses, cattle, sheep, pigs, goats, chickens, ducks, turkeys, and the like), laboratory 
animals (e.g. rats, mice, guinea pigs, rabbits), domestic animals (cats, dogs, birds and 
the like), feral or wild exotic animals (e.g. possums, cats, pigs, buffalo, wild dogs and the 

30 like) and humans. 

Alternatively, the antibodies may be for commercial or diagnostic purposes, in which 
case the subject to whom the diagnostic/prognostic protein or immunogenic fragment or 
epitope thereof is administered will most likely be a laboratory or farm animal. A wide 
35 range of animal species are used for the production of antisera. Typically the animal 
used for production of antisera is a rabbit, a mouse, rat, hamster, guinea pig, goat, 



WO 2004/022778 PCT/AU2003/001 166 

69 

sheep, pig, dog, horse, or chicken. Because of the relatively large blood volume of 
rabbits, a rabbit is a preferred choice for production of polyclonal antibodies. However, 
as will be known to those skilled in the art, larger amounts of immunogen are required to 
obtain high antibodies from large animals as opposed to smaller animals such as mice. 
5 In such cases, it will be desirable to isolate the antibody from the immunized animal. 

Preferably, the antibody is a high titer antibody. By "high titer" means a sufficiently high 
titer to be suitable for use in diagnostic or therapeutic applications. As will be known in 
the art, there is some variation in what might be considered "high titer". For most 
10 applications a titer of at least about 10 3 -10 4 is preferred. More preferably, the antibody 
titer will be in the range from about 10 4 to about 10 5 , even more preferably in the range 
from about 10 5 to about 10 8 . 

More preferably, in the case of B cell epitopes from pathogens, viruses or bacteria, the 
15 antibody is a neutralizing antibody (i.e. it is capable of neutralizing the infectivity of the 
organism fro which the B cell epitope is derived). 

To generate antibodies, the diagnostic/prognostic protein or immunogenic fragment or 
epitope thereof, optionally formulated with any suitable or desired carrier, adjuvant, BRM, 

20 or pharmaceutical^ acceptable excipient, is conveniently administered in the form of an 
injectable composition. Injection may be intranasal, intramuscular, sub-cutaneous, 
intravenous, intradermal, intraperitoneal, or by other known route. For intravenous 
injection, it is desirable to include one or more fluid and nutrient replenishes. Means for 
preparing and characterizing antibodies are well known in the art. (See, e.g., 

25 ANTIBODIES: A LABORATORY MANUAL, Cold Spring Harbor Laboratory, 1988, 
incorporated herein by reference). 

The efficacy of the diagnostic/prognostic protein or immunogenic fragment or epitope 
thereof in producing an antibody is established by injecting an animal, for example, a 

30 mouse, rat, rabbit, guinea pig, dog, horse, cow, goat or pig, with a formulation comprising 
the diagnostic/prognostic protein or immunogenic fragment or epitope thereof, and then 
monitoring the immune response to the B cell epitope, as described in the Examples. 
Both primary and secondary immune responses are monitored. The antibody titer is 
determined using any conventional immunoassay, such as, for example, ELISA, or radio 

35 immunoassay. 
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The production of polyclonal antibodies may be monitored by sampling blood of the 
immunized animal at various points following immunization. A second, booster injection, 
may be given, if required to achieve a desired antibody titer. The process of boosting 
and titering is repeated until a suitable titer is achieved. When a desired level of 
5 immunogenicity is obtained, the immunized animal is bled and the serum isolated and 
stored, and/or the animal is used to generate monoclonal antibodies (Mabs). 

For the production of monoclonal antibodies (Mabs) any one of a number of well-known 
techniques may be used, such as, for example, the procedure exemplified in US Patent 
10 No. 4,196,265, incorporated herein by reference. 

For example, a suitable animal will be immunized with an effective amount of the 
diagnostic/prognostic protein or immunogenic fragment or epitope thereof under 
conditions sufficient to stimulate antibody producing cells. Rodents such as mice and 
15 rats are preferred animals, however, the use of rabbit, sheep, or frog cells is also 
possible. The use of rats may provide certain advantages, but mice are preferred, with 
the BALB/c mouse being most preferred as the most routinely used animal and one that 
generally gives a higher percentage of stable fusions. 

20 Following immunization, somatic cells with the potential for producing antibodies, 
specifically B lymphocytes (B cells), are selected for use in the MAb generating protocol. 
These cells may be obtained from biopsied spleens, tonsils or lymph nodes, or from a 
peripheral blood sample. Spleen cells and peripheral blood cells are preferred, the former 
because they are a rich source of antibody-producing cells that are in the dividing 

25 plasmablast stage, and the latter because peripheral blood is easily accessible. Often, a 
panel of animals will have been immunized and the spleen of animal with the highest 
antibody titer removed. Spleen lymphocytes are obtained by homogenizing the spleen 
with a syringe. Typically, a spleen from an immunized mouse contains approximately 5 x 
1 0 7 to 2 x 1 0 8 lymphocytes. 

30 

The B cells from the immunized animal are then fused with cells of an immortal myeloma 
cell, generally derived from the same species as the animal that was immunized with the 
diagnostic/prognostic protein or immunogenic fragment or epitope thereof. Myeloma cell 
lines suited for use in hybridoma-producing fusion procedures preferably are non- 
35 antibody-producing, have high fusion efficiency and enzyme deficiencies that render 
them incapable of growing in certain selective media which support the growth of only the 
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desired fused cells, or hybridomas. Any one of a number of myeloma cells may be used 
and these are known to those of skill in the art (e.g. murine P3-X63/Ag8, X63-Ag8.653, 
NS1/1.Ag 4 1, Sp210-Agi4, FO, NSO/U, MPC-11. MPC11-X45-GTG 1.7 and 
S194/5XX0; or rat R210.RCY3, Y3-Ag 1.2.3, IR983F and 4B210; and U-266, GM1500- 
GRG2, LICR-LON-HMy2 and UC729-6). A preferred murine myeloma cell is the NS-1 
myeloma cell line (also termed P3-NS-1-Ag4-1), which is readily available from the 
NIGMS Human Genetic Mutant Cell Repository under Accession No. GM3573. 
Alternatively, a murine myeloma SP2/0 non-producer cell line that is 8-azaguanine- 
resistant is used. 

To generate hybrids of antibody-producing spleen or lymph node cells and myeloma 
cells, somatic cells are mixed with myeloma cells in a proportion between about 20:1 to 
about 1:1, respectively, in the presence of an agent or agents (chemical or electrical) that 
promote the fusion of cell membranes. Fusion methods using Sendai virus have been 
described by Kohler and Milstein, Nature 256, 495-497, 1975; and Kohler arid Milstein, 
Eur. J. Immunol. 6, 511-519, 1976. Methods using polyethylene glycol (PEG), such as 
37% (v/v) PEG, are described in detail by Gefter et at, Somatic Cell Genet. 3, 231-236, 
1977. The use of electrically induced fusion methods is also appropriate. 

Hybrids are amplified by culture in a selective medium comprising an agent that blocks 
the cfe novo synthesis of nucleotides in the tissue culture media. Exemplary and 
preferred agents are aminopterin, methotrexate and azaserine. Aminopterin and 
methotrexate block de novo synthesis of both purines and pyrimidines, whereas 
azaserine blocks only purine synthesis. Where aminopterin or methotrexate is used, the 
media is supplemented with hypoxanthine and thymidine as a source of nucleotides 
(HAT medium). Where azaserine is used, the media is supplemented with hypoxanthine. 

The preferred selection medium is HAT, because only those hybridomas capable of 
operating nucleotide salvage pathways are able to survive in HAT medium, whereas 
myeloma cells are defective in key enzymes of the salvage pathway, (e.g., hypoxanthine 
phosphoribosyl transferase or HPRT), and they cannot survive. B cells can operate this 
salvage pathway, but they have a limited life span in culture and generally die within 
about two weeks. Accordingly, the only cells that can survive in the selective media are 
those hybrids formed from myeloma and B cells. 
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The amplified hybridomas are subjected to a functional selection for antibody specificity 
and/or titer, such as, for example, by immunoassay (e.g. radioimmunoassay, enzyme 
immunoassay, cytotoxicity assay, plaque assay, dot immunobinding assay, and the like). 

5 The selected hybridomas are serially diluted and cloned into individual antibody- 
producing cell lines, which clones can then be propagated indefinitely to provide MAbs. 
The cell lines may be exploited for MAb production in two basic ways. A sample of the 
hybridoma is injected, usually in the peritoneal cavity, into a histocompatible animal of 
the type that was used to provide the somatic and myeloma cells for the original fusion. 

10 The injected animal develops tumors secreting the specific monoclonal antibody 
produced by the fused cell hybrid. The body fluids of the animal, such as serum or 
ascites fluid, can then be tapped to provide MAbs in high concentration. The individual 
cell lines could also be cultured in vitro, where the MAbs are naturally secreted into the 
culture medium from which they are readily obtained in high concentrations. MAbs 

15 produced by either means may be further purified, if desired, using filtration, 
centrifugation and various chromatographic methods such as HPLC or affinity 
chromatography. 

Monoclonal antibodies of the present invention also include anti-idiotypic antibodies 
20 produced by methods well-known in the art. Monoclonal antibodies according to the 
present invention also may be monoclonal heteroconjugates, (i.e., hybrids of two or more 
antibody molecules). In another embodiment, monoclonal antibodies according to the 
invention are chimeric monoclonal antibodies. In one approach, the chimeric monoclonal 
antibody is engineered by cloning recombinant DNA containing the promoter, leader, and 
25 variable-region sequences from a mouse anti-PSA producing cell and the constant- 
region exons from a human antibody gene. The antibody encoded by such a 
recombinant gene is a mouse-human chimera. Its antibody specificity is determined by 
the variable region derived from mouse sequences. Its isotype, which is determined by 
the constant region, is derived from human DNA. 

30 

In another embodiment, the monoclonal antibody according to the present invention is a 
"humanized" monoclonal antibody, produced by any one of a number of techniques well- 
known in the art. That is, mouse complementary determining regions ("CDRs") are 
transferred from heavy and light V-chains of the mouse Ig into a human V-domain, 
35 followed by the replacement of some human residues in the framework regions of their 
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murine counterparts. "Humanized" monoclonal antibodies in accordance with this 
invention are especially suitable for use in vivo in diagnostic and therapeutic methods. 

As stated above, the monoclonal antibodies and fragments thereof according to this 
invention are multiplied according to in vitro and in vivo methods well-known in the art. 
Multiplication in vitro is carried out in suitable culture media such as Dulbecco's modified 
Eagle medium or RPMI 1640 medium, optionally replenished by a mammalian serum 
such as fetal calf serum or trace elements and growth-sustaining supplements, e.g., 
feeder cells, such as normal mouse peritoneal exudate cells, spleen cells, bone marrow 
macrophages or the like. In vitro production provides relatively pure antibody 
preparations and allows scale-up to give large amounts of the desired antibodies. 
Techniques for large scale hybridoma cultivation under tissue culture conditions are 
known in the art and include homogenous suspension culture, (e.g., in an airlift reactor or 
in a continuous stirrer reactor or immobilized or entrapped cell culture). 

Large amounts of the monoclonal antibody of the present invention also may be obtained 
by multiplying hybridoma cells in vivo. Cell clones are injected into mammals which are 
histocompatible with the parent cells, (e.g., syngeneic mice, to cause growth of antibody- 
producing tumors. Optionally, the animals are primed with a hydrocarbon, especially oils 
such as Pristane (tetramethylpentadecane) prior to injection. 

In accordance with the present invention, fragments of the monoclonal antibody of the 
invention are obtained from monoclonal antibodies produced as described above, by 
methods which include digestion with enzymes such as pepsin or papain and/or 
cleavage of disulfide bonds by chemical reduction. Alternatively, monoclonal antibody 
fragments encompassed by the present invention are synthesized using an automated 
peptide synthesizer, or they may be produced manually using techniques well known in 
the art. 

The monoclonal conjugates of the present invention are prepared by methods known in 
the art, e.g., by reacting a monoclonal antibody prepared as described above with, for 
instance, an enzyme in the presence of a coupling agent such as glutaraldehyde or 
periodate. Conjugates with fluorescein markers are prepared in the presence of these 
coupling agents, or by reaction with an isothiocyanate. Conjugates with metal chelates 
are similarly produced. Other moieties to which antibodies may be conjugated include 
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radionuclides such as, for example, 3 H, 125 l, . 32 P, . 35 S, 14 C, 51 Cr, 36 C(, ^Co, ^Co, 59 Fe, 
75 Se, and 152 Eu. 

Radioactively labeled monoclonal antibodies of the present invention are produced 
5 according to well-known methods in the art. For instance, monoclonal antibodies are 
iodinated by contact with sodium or potassium iodide and a chemical oxidizing agent 
such as sodium hypochlorite, or an enzymatic oxidizing agent, such as lactoperoxidase. 
Monoclonal antibodies according to the invention may be labeled with technetium" by 
ligand exchange process, for example, by reducing pertechnetate with stannous solution, 
10 chelating the reduced technetium onto a Sephadex column and applying the antibody to 
this column or by direct labeling techniques, (e.g., by incubating pertechnate, a reducing 
agent such as SNCI 2 , a buffer solution such as sodium-potassium phthalate solution, and 
the antibody). 

15 Any immunoassay may be used to monitor antibody production by the 
diagnostic/prognostic protein or immunogenic fragment or epitope thereof . 
Immunoassays, in their most, simple and direct sense, are binding assays. Certain 
preferred immunoassays are the various types of enzyme linked immunosorbent assays 
(ELlSAs) and radioimmunoassays (RIA) known in the art. Immunohistochemical 

20 detection using tissue sections is also particularly useful. However, it will be readily 
appreciated that detection is not limited to such techniques, and Western blotting, dot 
blotting, FACS analyses, and the like may also be used. 

Most preferably, the assay will be capable of generating quantitative results. 

25 

For example, antibodies are tested in simple competition assays. A known antibody 
preparation that binds to the B cell epitope and the test antibody are incubated with an 
antigen composition comprising the B ceil epitope, preferably in the context of the native 
antigen. "Antigen composition" as used herein means any composition that contains 

30 some version of the B cell epitope in an accessible form. Antigen-coated wells of an 
ELISA plate are particularly preferred. In one embodiment, one would pre-mix the known 
antibodies with varying amounts of the test antibodies (e.g., 1:1, 1:10 and 1:100) for a 
period of time prior to applying to the antigen composition. If one of the known antibodies 
is labeled, direct detection of the label bound to the antigen is possible; comparison to an 

35 unmixed sample assay will determine competition by the test antibody and, hence, cross- 
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reactivity. Alternatively, using secondary antibodies specific for either the known or test 
antibody, one will be able to determine competition. 

An antibody that binds to the antigen composition will be able to effectively compete for 
5 binding of the known antibody and thus will significantly reduce binding of the latter. The 
reactivity of the known antibodies in the absence of any test antibody is the control. A 
significant reduction in reactivity in the presence of a test antibody is indicative of a test 
antibody that binds to the B cell epitope (i.e., it cross-reacts with the known antibody). 

10 In one exemplary ELISA, the antibodies against the diagnostic/prognostic protein or 
immunogenic fragment or B cell epitope are immobilized onto a selected surface 
exhibiting protein affinity, such as a well in a polystyrene microtiter plate. Then, a 
composition containing a peptide comprising the B cell epitope is added to the wells. 
After binding and washing to remove non-specifically bound immune complexes, the 

15 bound epitope may be detected. Detection is generally achieved by the addition of a 
second antibody that is known to bind to the B cell epitope and is linked to a detectable 
label. This type of ELISA is a simple "sandwich ELISA". Detection may also be achieved 
by the addition of said second antibody, followed by the addition of a third antibody that 
has binding affinity for the second antibody, with the third antibody being linked to a 

20 detectable label. 

Antibodies of the invention may be bound to a solid support and/or packaged into kits in 
a suitable container along with suitable reagents, controls, instructions and the like. 

25 Immunoassay formats 

In one embodiment, a cancer-associated protein or an immunogenic fragment or epitope 
thereof is detected in a patient sample, wherein the level of the protein or immunogenic 
fragment or epitope in the sample is indicative of ovarian cancer or disease recurrence or 
an indicator of poor survival. Preferably, the method comprises contacting a biological 

30 sample derived from the subject with an antibody capable of binding to a cancer- 
associated protein or an immunogenic fragment or epitope thereof, and detecting the 
formation of an antigen-antibody complex. 

In another embodiment, an antibody against a cancer-associated protein or epitope 
35 thereof is detected in a patient sample, wherein the level of the antibody in the sample is 
indicative of ovarian cancer or disease recurrence or an indicator of poor survival. 
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Preferably, the method comprises contacting a biological sample derived from the 
subject with a cancer-associated protein or an antigenic fragment eg., a B cell epitope or 
other immunogenic fragment thereof, and detecting the formation of an antigen-antibody 
complex. 

The diagnostic assays of the invention are useful for determining the progression of 
ovarian cancer or a metastasis thereof in a subject. In accordance with these prognostic 
applications of the invention, the levfel of a cancer-associated protein or an immunogenic 
fragment or epitope thereof in a biological sample is correlated with the disease state eg., 
as determined by clinical symptoms or biochemical tests (eg., CA125 levels). 

Accordingly, a further embodiment of the invention provides a method for detecting a 
cancer cell in a subject, said method comprising: 

(i) determining the level of a cancer-associate protein in a test sample from said 
subject; and 

(ii) comparing the level determined at (i) to the level of said cancer-associated 
protein in a comparable sample from a healthy or normal individual, 

wherein a level of said cancer-associate protein at (i) that is modified in the test sample 
relative to the comparable sample from the normal or healthy individual is indicative of 
the presence of a cancer cell in said subject. 

In one embodiment of the diagnostic/prognostic methods described herein, the biological 
sample is obtained previously from the subject. In accordance with such an 
embodiment, the prognostic or diagnostic method is performed ex vivo. 

In yet another embodiment, the subject diagnostic/prognostic methods further comprise 
processing the sample from the subject to produce a derivative or extract that comprises 
the analyte. 

Preferred detection systems contemplated herein include any known assay for detecting 
proteins or antibodies in a biological sample isolated from a human subject, such as, for 
example, SDS/PAGE, isoelectric focussing, 2-dimensional gel electrophoresis 
comprising SDS/PAGE and isoelectric focussing, an immunoassay, a detection based 
system using an antibody or non-antibody ligand of the protein, such as, for example, a 
small mplecule (e.g. a chemical compound, agonist, antagonist, allosteric modulator, 
competitive inhibitor, or non-competitive inhibitor, of the protein). In accordance with 
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these embodiments, the antibody or small molecule may be used in any standard solid 
phase or solution phase assay format amenable to the detection of proteins. Optical or 
fluorescent detection, such as, for example, using mass spectrometry, MALDI-TOF, 
biosensor technology, evanescent fiber optics, or fluorescence resonance energy 
5 transfer, is clearly encompassed by the present invention. Assay systems suitable for 
use in high throughput screening of mass samples, particularly a high throughput 
spectroscopy resonance, method (e.g. MALDI-TOF, electrospray MS or nano- 
electrospray MS), are particularly contemplated. 

10 Immunoassay formats are particularly preferred, eg., selected from the group consisting 
of, an immunoblot, a Western blot, a dot blot, an enzyme linked immunosorbent assay 
(ELISA), radioimmunoassay (RIA), enzyme immunoassay. Modified immunoassays 
utilizing fluorescence resonance energy transfer (FRET), isotope-coded affinity tags 
(ICAT), matrix-assisted laser desorption/ionization time of flight (MALDI-TOF), 

15 electrospray ionization (ESI), biosensor technology, evanescent fiber-optics technology 
or protein chip technology are also useful. 

Preferably, the assay is a semi-quantitative assay or quantitative assay. 

20 Standard solid phase ELISA formats are particularly useful in determining the 
concentration of a protein or antibody from a variety of patient samples. 

In one form such as an assay involves immobilising a biological sample comprising 
antibodies against the cancer-associated protein or epitope, or alternatively an ovarian 
25 cancer-associated protein or an immunogenic fragment thereof, onto a solid matrix, such 
as, for example a polystyrene or polycarbonate microwell or dipstick, a membrane, or a 
glass support (e.g. a glass slide). 

In the case of an antigen-based assay, an antibody that specifically binds an ovarian 
30 cancer-associated protein is brought into direct contact with the immobilised biological 
sample, and forms a direct bond with any of its target protein present in said sample. For 
an antibody-based assay, an immobilized ovarian cancer-associated protein or an 
immunogenic fragment or epitope thereof is contacted with the sample. The added 
antibody or protein in solution is generally labelled with a detectable reporter molecule, 
35 such as for example, a fluorescent label (e.g. FITC or Texas Red) or an enzyme (e.g. 
horseradish peroxidase (HRP)), alkaline phosphatase (AP) or p-galactosidase. 
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Alternatively, or in addition, a second labelled antibody can be used that binds to the first 
antibody or to the isolated/recombinant antigen. Following washing to remove any 
unbound antibody or antigen, as appropriate, the label is detected either directly, in the 
case of a fluorescent label, or through the addition of a substrate, such as for example 
5 hydrogen peroxide, TMB, or toluidine, or 5-bromo-4-chloro-3-indol-beta-D- 
galaotopyranoside (x-gal). 

Such ELISA based systems are particularly suitable for quantification of the amount of a 
protein or antibody in a sample, such as, for example, by calibrating the detection system 
10 against known amounts of a standard. 

In another form, an ELISA consists of immobilizing an antibody that specifically binds an 
ovarian cancer-associated protein on a solid matrix, such as, for example, a membrane, 
a polystyrene or polycarbonate microwell, a polystyrene or polycarbonate dipstick or a 
15 glass support. A patient sample is then brought into physical relation with said antibody, 
and the antigen in the sample is bound or 'captured'. The bound protein can then be 
detected using a labelled antibody. For example if the protein is captured from a human 
sample, an anti-human antibody is used to detect the captured protein. Alternatively, a 
third labelled antibody can be used that binds the second (detecting) antibody. 

20 

It will be apparent to the skilled person that the assay formats described herein are 
amenable to high throughput formats, such as, for example automation of screening 
processes, or a microarray format as described in Mendoza et a/, Biotechnigues 27(4): 
778-788, 1999. Furthermore, variations of the above described assay will be apparent to 
25 those skilled in the art, such as, for example, a competitive ELISA. 

Alternatively, the presence of antibodies against the cancer-associate protein, or 
alternatively an oarian cancer-associated protein or an immunogenic fragment thereof, is 
detected using a radioimmunoassay .(RIA). The basic principle of the assay is the use of 

30 a radiolabeled antibody or antigen to detect antibody antigen interactions. For example, 
an antibody that specifically binds to an ovarian cancer-associated protein can be bound 
to a solid support and a biological sample brought into direct contact with said antibody. 
To detect the bound antigen, an isolated and/or recombinant form of the antigen is 
radiolabeled is brought into contact with the same antibody. Following washing the 

35 amount of bound radioactivity is detected. As any antigen in the biological sample inhibits 
binding of the radiolabeled antigen the amount of radioactivity detected is inversely 
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proportional to the amount of antigen in the sample. Such an assay may be quantitated 
by using a standard curve using increasing known concentrations of the isolated antigen. 

As will be apparent to the skilled artisan, such an assay may be modified to use any 
5 reporter molecule, such as, for example, an enzyme or a fluorescent molecule, in place 
of a radioactive label. 

Western blotting is also useful for detecting an ovarian cancer-associated protein or an 
immunogenic fragment thereof. In such an assay protein from a biological sample is 

10 separated using sodium dodecyl sulphate (SDS) polyacrylamide gel electrophoresis 
(SDS-PAGE) using techniques well known in the art and described in, for example, 
Scopes (Iqi Protein Purification: Principles and Practice, Third Edition, Springer Verlag, 
1994). Separated proteins are then transferred to a solid support, such as, for example, 
a membrane or more specifically PVDF membrane, using methods well known in the art, 

15 for example, electrotransfer. This membrane may then be blocked and probed with a 
labelled antibody or ligand. that specifically binds an ovarian cancer-associated protein. 
Alternatively, a labelled secondary, or even tertiary, antibody or ligand can be used to 
detect the binding of a specific primary antibody. 

20 High-throughput methods for detecting the presence or absence of antibodies, or 
alternatively ovarian cancer-associated protein or an immunogenic fragment thereof are 
particularly preferred. 

In one embodiment, MALDI-TOF is used for the rapid identification of a protein. 

25 Accordingly, there is no need to detectthe proteins of interest using an antibody or ligand 
that specifically binds to the protein of interest. Rather, proteins from a biological sample 
are separated using gel electrophoresis using methods well known in the art and those 
proteins at approximately the correct molecular weight and/or isoelectric point are 
analysed using MALDI-TOF to determine the presence or absence of a protein of 

30 interest. 

Alternatively, MALDI or ESI or a combination of approaches is used to determine the 
concentration of a particular protein in a biological sample, such as, for example sputum. 
Such proteins are preferably well characterised previously with regard to parameters 
35 such as molecular weight and isoelectric point. 
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Biosensor devices generally employ an electrode surface in combination with current or 
impedance measuring elements to be integrated into a device in combination with the 
assay substrate (such as that described in U.S. Patent No. 5,567,301). An antibody or 
ligand that specifically binds to a protein of interest is preferably incorporated onto the 
surface of a biosensor device and a biological sample isolated from a patient (for 
example sputum that has been solubilised using the methods described herein) 
contacted to said device. A change in the detected current or impedance by the 
biosensor device indicates protein binding to said antibody or ligand. Some forms of 
biosensors known in the art also rely on surface plasmon resonance to detect protein 
interactions, whereby a change in the surface plasmon resonance surface of reflection is 
indicative of a protein binding to a ligand or antibody (U.S. Patent No. 5,485,277 and 
5,492,840). 

Biosensors are of particular use in high throughput analysis due to the ease of adapting 
such systems to micro- or nano-scales. Furthermore, such systems are conveniently 
adapted to incorporate several detection reagents, allowing for multiplexing of diagnostic 
reagents in a single biosensor unit. This permits the simultaneous detection of several 
epitopes in a small amount of body fluids. 

Evanescent biosensors are also preferred as they do not require the pretreatment of a 
biological sample prior to detection of a protein of interest. An evanescent biosensor 
generally relies upon light of a predetermined wavelength interacting with a fluorescent 
molecule, such as for example, a fluorescent antibody attached near the probe's surface, 
to emit fluorescence at a different wavelength upon binding of the diagnostic protein to 
the antibody or ligand. 

To produce protein chips, the proteins, peptides, polypeptides, antibodies or ligands that 
are able to bind specific antibodies or proteins of interest are bound to a solid support 
such as for example, glass, polycarbonate, polytetrafluoroethylene, polystyrene, silicon 
oxide, metal or silicon nitride. This immobilization is either direct (e.g. by covalent 
linkage, such as, for example, Schiffs base formation, disulfide linkage, or amide or urea 
bond formation) or indirect. Methods of generating a protein chip are known in the art and 
are described in for example U.S. Patent Application No. 20020136821, 20020192654, 
20020102617 and U.S. Patent No. 6,391,625. In order to bind a protein to a solid support 
it is often necessary to treat the solid support so as to create chemically reactive groups 
on the surface, such as, for example, with an aldehyde-containing silane reagent. 
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Alternatively, an antibody or ligand may be captured on a microfabricated polyacrylamide 
gel pad and accelerated into the gel using microelectrophoresis as described in, Arenkov 
et a/. Anal. Biochem. 278: 1 23-1 31 , 2000. 

5 A protein chip is preferably generated such that several proteins, ligands or antibodies 
are arrayed on said chip. This format permits the simultaneous screening for the 
presence of several proteins in a sample. 

Alternatively, a protein chip may comprise only one protein, ligand or antibody, and be 
10 used to screen one or more patient samples for the presence of one polypeptide of 
interest Such a chip may also be used to simultaneously screen an array of patient 
samples for a polypeptide of interest. 

Preferably, a sample to be analysed using a protein chip is attached to a reporter 
15 molecule, such as, for example, a fluorescent molecule, a radioactive molecule, an 
enzyme, or an antibody that is detectable using methods well known in the art. 
Accordingly, by contacting a protein chip with a labelled sample and subsequent washing 
to remove any unbound proteins the presence of a bound protein is detected using 
methods well known in the art, such as, for example using a DNA microarray reader. 

20 

Alternatively, biomolecular interaction analysis-mass spectrometry (BIA-MS) is used to 
rapidly detect and characterise a protein present in complex biological samples at the 
low- to sub-fmole level (Nelson et al Bectrophoresis 21: 1155-1163, 2000). One 
technique useful in the analysis of a protein chip is surface enhanced laser 
25 desorption/ionization-time of flight-mass spectrometry (SELDI-TOF-MS) technology to 
characterise a protein bound to the protein chip. Alternatively, the protein chip is 
analysed using ESI as described in U.S. Patent Application 20020139751. 

As will be apparent to the skilled artisan, protein chips are particularly amenable to 
30 multiplexing of detection reagents. Accordingly, several antibodies or ligands each able 
to specifically bind a different peptide or protein may be bound to different regions of said 
protein chip. Analysis of a biological sample using said chip then permits the detecting of 
multiple proteins of interest, or multiple B cell epitopes of the ovarian cancer-associated 
protein. Multiplexing of diagnostic and prognostic markers is particularly contemplated in 
35 the present invention. 
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In a further embodiment, the samples are analysed using ICAT, essentially as described 
in US Patent Application No. 20020076739. This system relies upon the labelling of a 
protein sample from one source (i.e. a healthy individual) with a reagent and the labelling 
of a protein sample from another source (i.e. a tuberculosis patient) with a second 
5 reagent that is chemically identical to the first reagent, but differs in mass due to isotope 
composition. It is preferable that the first and second reagents also comprise a biotin 
molecule. Equal concentrations of the two samples are then mixed, and peptides 
recovered by avidin affinity chromatography. Samples are then analysed using mass 
spectrometry. Any difference in peak heights between the heavy and light peptide ions 
10 directly correlates with a difference in protein abundance in a biological sample. The 
identity of such proteins may then be determined using a method well known in the art, 
such as, for example MALDI-TOF, or ESI. 

As will be apparent to those skilled in the art a diagnostic or prognostic assay described 
15 herein may be a multiplexed assay. As used herein the term "multiplex", shall be 
understood not only to mean the detection of two or more diagnostic or prognostic 
markers in a single sample simultaneously, but also to encompass consecutive detection 
of two or more diagnostic or prognostic markers in a single sample, simultaneous 
detection of two or more diagnostic or prognostic markers in distinct but matched 
20 samples, and consecutive detection of two or more diagnostic or prognostic markers in 
distinct but matched samples. As used herein the term "matched samples" shall be 
understood to mean two or more samples derived from the same initial biological sample, 
or two or more biological samples isolated at the same point in time. 

25 Accordingly, a multiplexed assay may comprise an assay that detects several antibodies 
and/or epitopes in the same reaction and simultaneously, or alternatively, it may detect 
other one or more antigens/antibodies in addition to one or more antibodies and/or 
epitopes. As will be apparent to the skilled artisan, if such an assay is antibody or ligand 
based, both of these antibodies must function under the same conditions. 

30 

Diagnostic assay kits 

A further aspect of the present invention provides a kit for detecting M. tuberculosis 
infection in a biological sample. In one embodiment, the kit comprises: 

(i) one or more isolated antibodies that bind to an ovarian cancer-associated protein 
35 or an immunogenic fragment or epitope thereof; and 

(ii) means for detecting the formation of an antigen-antibody complex. 
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In an alternative embodiment, the kit comprises: 

(i) an isolated or recombinant ovarian cancer-associated protein or an immunogenic 
fragment or epitope thereof; and 
5 (ii) means for detecting the formation of an antigen-antibody complex. 

Optionally, the kit further comprises means for the detection of the binding of an 
antibody, fragment thereof or a ligand to an ovarian cancer-associated protein. Such 
means include a reporter molecule such as, for example, an enzyme (such as 
10 horseradish peroxidase or alkaline phosphatase), a substrate, a cofactor, an inhibitor, a 
dye, a radionucleotide, a luminescent group, a fluorescent group, biotin or a colloidal 
particle, such as colloidal gold or selenium. Preferably such a reporter molecule is 
directly linked to the antibody or ligand. 

15 In yet another embodiment, a kit may additionally comprise a reference sample. Such a 
reference sample. 

In another embodiment, a reference sample comprises a peptide that is detected by an 
antibody or a ligand. Preferably, the peptide is of known concentration. Such a peptide 
20 is of particular use as a standard. Accordingly various known concentrations of such a 
peptide may be detected using a prognostic or diagnostic assay described herein. 

In yet another embodiment, a kit comprises means for protein isolation (Scopes (Arr 
Protein Purification: Principles and Practice, Third Edition, Springer Verlag, 1994). 

25 

Bioinformatics 

The ability to identify genes that are over or under expressed in ovarian cancer can 
additionally provide high-resolution, high-sensitivity datasets which are used in the areas 
of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

30 biosensor development, and other related areas. For example, the expression profiles 
are used in diagnostic or prognostic evaluation of patients with ovarian cancer. Or as 
another example, subcellular toxicological information are generated to better direct drug 
structure and activity correlation (see Anderson, Pharmaceutical Proteomics: Targets, 
Mechanism, and Function, paper presented at the IBC Proteomics conference, 

35 Coronado, CA (June 11-12, 1998)). Subcellular toxicological information can also be 
. utilized in a biological sensor device to predict the likely toxicological effect of chemical 
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exposures and likely tolerable exposure thresholds (see U.S. Patent No. 5,811,231).- 
Similar advantages accrue from datasets relevant to other biomolecules and bioactive 
agents (e.g., nucleic acids, saccharides, lipids, drugs, and the like). 

5 Thus, in another embodiment, the present invention provides a database that includes at 
least one set of assay data. The data contained in the database is acquired, e.g., using 
array analysis either singly or in a library format. The database are in substantially any 
form in which data are maintained and transmitted, but is preferably an electronic 
database. The electronic database of the invention are maintained on any electronic 
10 device allowing for the storage of and access to the database, such as a personal 
computer, but is preferably distributed on a wide area network, such as the World Wide 
Web. 

The focus of the present section on databases that include peptide sequence data is for 
15 clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases are assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 
absolute abundance of a variety of molecular and macromolecular species from a 

20 biological sample undergoing ovarian cancer, i.e., the identification of ovarian cancer- 
associated sequences described herein, provide an abundance of information, which are 
correlated with pathological conditions, predisposition to disease, drug testing, 
therapeutic monitoring, gene-disease causal linkages, identification of correlates of 
immunity and physiological status, among others. Although the data generated from the 

25 assays of the invention is suited for manual review and analysis, in a preferred 
embodiment, prior data processing using high-speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is known in the 
art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 

30 system for storing biomolecular sequence information in a manner that allows sequences 
to be catalogued and searched according to one or more protein function hierarchies. 
U.S. Patent 5,953,727 discloses a relational database having sequence records 
containing information in a format that allows a collection of partial-length DNA 
sequences to be catalogued and searched according to association with one or more 

35 sequencing projects for obtaining full-length sequences from the collection of partial 
length sequences. U.S. Patent 5,706,498 discloses a gene database retrieval system for 
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database based on the degree of similarity between a key sequence and a target 
sequence. U.S. Patent 5,538,897 discloses a method using mass spectroscopy 
fragmentation patterns of peptides to identify amino acid sequences in computer 
5 databases by comparison of predicted mass spectra with experimentally-derived mass 
spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. 
10 U.S. Patent 5,295,261 reports a hybrid database structure in which the fields of each 
database record are divided into two classes, navigational and informational data, with 
navigational fields stored in a hierarchical topological map which are viewed as a tree 
structure or as the merger of two or more such tree structures. 

15 See also Mount et al., Bioinformatics (2001); Biological Sequence Analysis: Probabilistic 
Models of Proteins and Nucleic Acids (Durbin et ai, eds., 1999); Bioiraformatics: A 
Practical Guide to the Analysis of Genes and Proteins (Baxevanis & Oeullette eds., 
1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological Science and 
Medicine (1999); Introduction to Computational Molecular Biology (Setubal et ai, eds 

20 1997); Bioinformatics: Methods and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinfor7natics: A Biologists Guide to Biocomputing and the 
Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 
Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes 

25 (1995). 

The present invention provides a computer database comprising a computer and 
software for storing in computer-retrievable form assay data records cross-tabulated, 
e.g., with data specifying the source of the target-containing sample from which each 
30 sequence specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample is 
from a control tissue sample known to be free of pathological disorders, in a variation, at 
least one of the sources is a known pathological tissue specimen, e.g., a neoplastic 
35 lesion or another tissue specimen to be analyzed for prostate cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for 
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each target species in a sample: (1) a unique identification code, which can include, e.g., 
a target molecular structure, and/or characteristic separation coordinate (e.g., 
electrophoretic coordinates); (2) sample source; and (3) absolute and/or relative quantity 
of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data in a 
computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and 
on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge 
states or transistor gate states, such as an array of cells in a DRAM device (e.g., each 
cell comprised of a transistor and a charge storage area, which are on the transistor). In 
one embodiment, the invention provides such storage devices, and computer systems 
built therewith, comprising a bit pattern encoding a protein expression fingerprint record 
comprising unique identifiers for at least 10 target data records cross-tabulated with 
target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a method 
for identifying related peptide or nucleic acid sequences, comprising performing a 
computerised comparison between a peptide or nucleic acid sequence assay record 
stored in or retrieved from a computer storage device or database and at least one other 
sequence. The comparison can include a sequence analysis or comparison algorithm or 
computer program embodiment thereof (e.g., BLAST, FASTA, TFASTA, GAP, BESTFIT 
see above) and/or the comparison are of the relative amount of a peptide or nucleic 
acid sequence in a pool of sequences determined from a polypeptide or nucleic acid 
sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/.98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard 
(fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized 
sequence analysis, comparison, or relative quantitation method. 
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The invention also provides a network, comprising a plurality of computing devices linked 
via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN line, 
wireless network, optical fiber, or other suitable signal transmission medium, whereby at 
least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
5 magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a 
10 modem, ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the 
signal includes (in native or encrypted format) a bit pattern encoding data from an assay 
or a database comprising a plurality of assay results obtained by the method of the 
invention. 

15 In a preferred embodiment, the invention provides a computer system for comparing a 
query target to a database containing an array of data structures, such as an assay result 
obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 
initialized to load and execute the computer program for alignment and/or comparison of 

20 the assay results. Data for a query target is entered into the central processor via an I/O 
device. Execution of the computer program results in the central processor retrieving the 
assay data from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program are transferred to secondary 
25 memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a 
selected assay characteristic (e.g., binding to a selected affinity moiety) and the same 
characteristic of the query target and results are output via an I/O device. For example, a 
central processor are a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 
30 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program are a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file are an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory 
etc.); an I/O device are a terminal comprising a video display and a keyboard, a modem, 
35 an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip 
reader, or other suitable I/O device. 



WO 2004/022778 



PCT/AU2003/001166 



88 



The invention also preferably provides the use of a computer system, such as that 
described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
collection of peptide sequence specificity records obtained by the methods of the 
5 Invention, which are stored in the computer; (3) a comparison target, such as a query 
target; and (4) a program for alignment and comparison, typically with rank-ordering of 
comparison results on the basis of computed similarity values. 

Transgenic Animals Expressing Ovarian Cancer-associated proteins and "Knock-Out" 
10 Animals 

The present invention also contemplates transgenic animals which are transgenic by 
virtue of comprising a polynucleotide of the invention, i.e. animals transformed with a 
cancer-associated gene of the invention. Suitable animals are generally from the phylum 
chordata. Chordates includes vertebrate groups such as mammals, birds, reptiles and 
15 amphibians. Particular examples of mammals include non-human primates, cats, dogs, 
ungulates such as cows, goats, pigs, sheep and horses and rodents such as mice, rats, 
gerbils and hamsters. Transgenic animals within the meaning of the present invention 
are non-human animals and the production of transgenic humans is specifically 
excluded. 

20 

Techniques for producing transgenic animals are well known in the art. A useful general 
textbook on this subject is Houdebine, Transgenic animals - Generation and Use 
(Harwood Academic, 1997) - an extensive review of the techniques used to generate 
transgenic animals from fish to mice and cows. 

25 

Advances in technologies for embryo micromanipulation now permit introduction of 
heterologous DNA into, for example, fertilized mammalian ova. For instance, totipotent or 
pluripotent stem cells are transformed by microinjection, calcium phosphate mediated 
precipitation, liposome fusion, retroviral infection or other means, the transformed cells 

30 are then introduced into, the embryo, and the embryo then develops into a transgenic 
animal. In a highly preferred method, developing embryos are infected with a retrovirus 
containing the desired DNA, and transgenic animals produced from the infected embryo. 
In a most preferred method, however, the appropriate DNAs are coinjected into the 
pronucleus or cytoplasm of embryos, preferably at the single cell stage, and the embryos 

35 allowed to develop into mature transgenic animals. Those techniques as well known. See 
reviews of standard laboratory procedures for microinjection of heterologous DNAs into 
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mammalian fertilized ova, including Hogan etal., Manipulating the Mouse Embryo, (Cold 
Spring Harbor Press 1986); Krimpenfort et a/., Bio/Technology 9:844 (1991); Palmiter et 
a/., Cell, 41: 343 (1985); Kraemer etal., Genetic manipulation of the Mammalian Embryo, 
(Cold Spring Harbor Laboratory Press 1985); Hammer et a/., Nature, 315: 680 (1985); 
5 Wagner et a/., U.S. Pat. No. 5,175,385; Krimpenfort et a/., U.S. Pat. No. 5,175,384, the 
respective contents of which are incorporated herein by reference 

Another method used to produce a transgenic animal involves microinjecting a nucleic 
acid into pro-nuclear stage eggs by standard methods. Injected eggs are then cultured 
10 before transfer into the oviducts of pseudopregnant recipients. 

Transgenic animals may also be produced by nuclear transfer technology as described in 
Schnieke, A.E. et a/., 1997, Science, 278: 2130 and Cibelli, J.B. et a/., 1998, Science, 
280: 1256. Using this method, fibroblasts from donor animals are stably transfected with 
15 a plasmid incorporating the coding sequences for a binding domain or binding partner of 
interest under the control of regulatory. Stable transfectants are then fused to enucleated 
oocytes, cultured and transferred into female recipients. 

Analysis of animals which may contain transgenic sequences would typically be 
20 performed by either PGR or Southern blot analysis following standard methods. 

By way of a specific example for the construction of transgenic mammals, such as cows, 
nucleotide constructs comprising a sequence encoding a binding domain fused to GFP 
are microinjected using, for example, the technique described in U.S. Pat. No. 4,873,191, 
25 into oocytes which are obtained from ovaries freshly removed from the mammal. The 
oocytes are aspirated from the follicles and allowed to settle before fertilization with 
thawed frozen sperm capacitated with heparin and prefractionated by Percoll gradient to 
isolate the motile fraction. 

30 The fertilized oocytes are centrifuged, for example, for eight minutes at 15,000 g to 
visualize the pronuclei for injection and then cultured from the zygote to morula or 
blastocyst stage in oviduct tissue-conditioned medium. This medium is prepared by using 
luminal tissues scraped from oviducts and diluted in culture medium. The zygotes must 
be placed in the culture medium within two hours following microinjection. 
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Oestrous is then synchronized in the intended recipient mammals, such as cattle, by 
administering coprostanol. Oestrous is produced within two days and the embryos are 
transferred to the recipients 5-7 days after estrous. Successful transfer are evaluated in 
the offspring by Southern blot. 

5 

Alternatively, the desired constructs are introduced into embryonic stem cells (ES cells) 
and the cells cultured to ensure modification by the transgene. The modified cells are 
then injected into the blastula embryonic stage and the blastulas replaced into 
pseudopregnant hosts. The resulting offspring are chimeric with respect to the ES and 
10 host cells, and nonchimeric strains which exclusively comprise the ES progeny are 
obtained using conventional cross-breeding. This technique is described, for example, in 
WO91/10741. 

In another embodiment, transgenic animals of the present invention are transgenic 
15 "knock-out" animals where a specific gene corresponding to a polynucleotide referred to 
in Tables 1-3 has been rendered non-functional by homologous recombination. The 
generation of "knock-out" animals is similar to the production of other transgenic animals 
except that the polynucleotide constructs are designed to integrate into the endogenous 
genes and disrupt the function of the endogenous sequences. The generation of "knock- 
20 ouf animals is known in the art, including the design of suitable constructs that will 
recombine at the appropriate site in the genome. 

In one embodiment, the heterologous sequence which it is desired to recombine into the 
genome of a target animal comprises a functional sequence but under the control of an 
25 inducible promoter so that expression of the gene are regulated by administration of an 
endogenous molecule. This are advantageous where disruption of the gene is 
embryonic-lethal. 

"Knock-out" animals are used as animal models for the study of gene function. 

30 

Therapeutic peptides 

In accordance with this embodiment, ovarian cancer-associated proteins of the present 
invention are administered therapeutically to patients for a time and under conditions 
sufficient to ameliorate the growth of a tumor in the subject or to prevent tumor 
35 recurrence. 
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It is preferred to use peptides that do not consisting solely of naturally-occurring amino 
acids but which have been modified, for example to reduce immunogenicity, to increase 
circulatory half-life in the body of the patient, to enhance bioavailability and/or to enhance 
efficacy and/or specificity. 

A number of approaches have been used to modify peptides for therapeutic application. 
One approach is to link the peptides or proteins to a variety of polymers, such as 
polyethylene glycol (PEG) and polypropylene glycol (PPG) - see for example U.S. Patent 
Nos. 5,091,176, 5,214,131 and US 5,264,209. 

Replacement of naturally-occurring amino acids with a variety of uncoded or modified 
amino acids such as D-amino acids and N-methyl amino acids may also be used to 
modify peptides 

Another approach is to use Afunctional crosslinkers, such as N-succinimidyl 3-(2 
pyridyldithio) propionate, succinimidyl 6-[3-(2 pyridyldithio) propionamido] hexanoate, and 
sulfosuccinimidyl 6-[3-(2 pyridyldithio) propionamidojhexanoate (see US Patent 
5,580,853). 

It are desirable to use derivatives of the ovarian cancer-associated proteins of the 
invention which are conformationally constrained. Conformational constraint refers to the 
stability and preferred conformation of the three-dimensional shape assumed by a 
peptide. Conformational constraints include local constraints, involving restricting the 
conformational mobility of a single residue in a peptide; regional constraints, involving 
restricting the conformational mobility of a group of residues, which residues may form 
some secondary structural unit; and global constraints, involving the entire peptide 
structure. 

The active conformation of the peptide are stabilized by a covalent modification, such as 
cyclization or by incorporation of gamma-lactam or other types of bridges. For example, 
side chains are cyclized to the backbone so as create a L-gamma-lactam moiety on each 
side of the interaction site. See, generally, Hruby et al., "Applications of Synthetic 
Peptides," in Synthetic Peptides: A User's Guide: 259-345 (W. H. Freeman & Co. 1992). 
Cyclization also are achieved, for example, by formation of cystine bridges, coupling of 
amino and carboxy terminal groups of respective terminal amino acids, or coupling of the 
amino group of a Lys residue or a related homolog with a carboxy group of Asp, Glu or a 
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related homolog. Coupling of the .alpha-amino group of a polypeptide with the epsilon- 
amino group of a lysine residue, using iodoacetic anhydride, are also undertaken. See 
Wood and Wetzel, 1992, Infl J. Peptide Protein Res. 39: 533-39. 

5 Another approach described in US 5,891,418 is to include a metal-ion complexing 
backbone in the peptide structure. Typically, the preferred metal-peptide backbone is 
based on the requisite number of particular coordinating groups required by the 
coordination sphere of a given complexing metal ion. In general, most of the metal ions 
that may prove useful have a coordination number of four to six. The nature of the 

10 coordinating groups in the peptide chain includes nitrogen atoms with amine, amide, 
imidazole, or guanidino functionalities; sulfur atoms of thiols or disulfides; and oxygen 
atoms of hydroxy, phenolic, carbonyl,' or carboxyl functionalities. In addition, the peptide 
chain or individual amino acids are chemically altered to include a coordinating group, 
such as for example oxime, hydrazino, sulfhydryl, phosphate, cyano, pyridino, piperidino, 

15 or morpholino. The peptide construct are either linear or cyclic, however a linear 
construct is typically preferred. One example of a small linear peptide is Gly-Gly-Gly-Gly 
which has four nitrogens (an N 4 complexation system) in the back bone that can complex 
to a metal ion with a coordination number of four. 

20 A further technique for improving the properties of therapeutic peptides is to use 
non-peptide peptidomimetics. A wide variety of useful techniques are used to elucidating 
the precise structure of a peptide. These techniques include amino acid sequencing, x- 
ray crystallography, mass spectroscopy, nuclear magnetic resonance spectroscopy, 
computer-assisted molecular modeling, peptide mapping, and combinations thereof. 

25 Structural analysis of a peptide generally provides a large body of data which comprise 
the amino acid sequence of the peptide as well as the three-dimensional positioning of its 
atomic components. From this information, non-peptide peptidomimetics are designed 
that have the required chemical functionalities for therapeutic activity but are more stable, 
for example less susceptible to biological degradation. An example of this approach is 

30 provided in US 5,81 1 ,512. . 

Techniques for chemically synthesising therapeutic peptides of the invention are 
described in the above references and also reviewed by Borgia and Fields, 2000, 
TibTech 18: 243-251 and described in detail in the references contained therein. 

35 

Assays for therapeutic compounds 
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The ovarian cancer, proteins, nucleic acids, and antibodies as described herein are used 
in drug screening assays to identify candidate compounds for use in treating ovarian 
cancer. The ovarian cancer-associated proteins, antibodies, nucleic acids, modified 
proteins and cells containing ovarian cancer sequences are used in drug screening 
5 assays or by evaluating the effect of drug candidates on a "gene expression profile" or 
expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
Zlokarnik, etaL, 1998, Science 279: 84-88); Heid, 1996, Genome Res 6: 986-94). 

10 

In a preferred embodiment, the ovarian cancer-associated proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified ovarian cancer- 
associated proteins are used in screening assays. That is, the present invention provides 
methods for screening for compounds/agents which modulate the ovarian cancer 

15 phenotype or an identified physiological function of a ovarian cancer-associated protein. 
As above, this are done on an individual gene level or by evaluating the effect of drug 
candidates on a "gene expression profile". In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques to 
allow monitoring for expression profile genes after treatment with a candidate agent, see 

20 Zlokarnik, supra. 

Having identified the differentially expressed genes herein, a variety of assays are 
executed. In a preferred embodiment, assays are run on an individual gene or protein 
level. That is, having identified a particular gene as up regulated in ovarian cancer, test 

25 compounds are screened for the ability to modulate gene expression or for binding to the 
ovarian cancer-associated protein. "Modulation" thus includes both an increase and a 
decrease in gene expression. The preferred amount of modulation will depend on the 
original change of the gene expression in normal versus tissue undergoing ovarian 
cancer, with changes of at least 10%, preferably 50%, more preferably 100-300%, and in 

30 some embodiments 300-1000% or greater. Thus, if a gene exhibits a 4-fold increase in 
ovarian cancer tissue compared to normal tissue, a decrease of about four-fold is often 
desired; similarly, a 10-fold decrease in ovarian cancer tissue compared to normal tissue 
often provides a target value of a 10-fold increase in expression to be induced by the test 
compound. 

35 
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The amount of gene expression are monitored using nucleic acid probes and the 
quantification of gene expression levels, or, alternatively, the gene product itself are 
monitored, e.g., through the use of antibodies to the ovarian cancer-associated protein 
and standard immunoassays. Proteomics and separation techniques may also allow 
5 quantification of expression. . 

In a preferred embodiment, gene expression or protein monitoring of a number of 
entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein. 

10 

In this embodiment, the ovarian cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of ovarian cancer sequences in a 
particular cell. Alternatively, PCR are used. Thus, a series are used with dispensed 
primers in desired wells. A PCR reaction can then be performed and analyzed for each 
15 well. 

Expression monitoring are performed to identify compounds that modify the expression 
of one or more ovarian cancer-associated sequences, e.g., a polynucleotide sequence 
set out in Tables 1-3. In a preferred embodiment, a test modulator is added to the cells 
20 prior to analysis. Moreover, screens are also provided to identify agents that modulate 
ovarian cancer, modulate ovarian cancer-associated proteins, bind to a ovarian cancer- 
associated protein, or interfere with the binding of a ovarian cancer-associated protein 
and an antibody or other binding partner. 

25 The term "test compound" or "drug candidate" or "modulator or grammatical equivalents 
as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the ovarian cancer phenotype or the expression of a ovarian cancer 
sequence, e.g., a nucleic acid or protein sequence. In preferred embodiments, 

30 modulators alter expression profiles, or expression profile nucleic acids or proteins 
provided herein. In one embodiment, the modulator suppresses a ovarian cancer 
phenotype, e.g. to a normal tissue fingerprint. In another embodiment, a modulator 
induced a ovarian cancer phenotype. Generally, a plurality of assay mixtures are run in 
parallel with different agent concentrations to obtain a differential response to the various 

35 concentrations. Typically, one of these concentrations serves as a negative control, i.e., 
at zero concentration or below the level of detection. 
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. Drug candidates encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less 
5 than 2000, or less than 1500 or less than 1000 or less than 500 Daltons. Candidate 
agents comprise functional groups necessary for structural interaction with proteins, 
particularly hydrogen bonding, and typically include at least an amine, carbonyl, hydroxyl 
or carboxyl group, preferably at least two of the functional chemical groups. The 
candidate agents often comprise cyclical carbon or heterocyclic structures and/or 
10 aromatic or polyaromatic structures substituted with one or more of the above functional 
groups. Candidate agents are also found among biomolecules including peptides, 
saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs or 
combinations thereof. Particularly preferred are peptides. 

15 In one aspect, a modulator will neutralize the effect of a ovarian cancer-associated 
protein. By "neutralize" is meant that activity of a protein is inhibited or blocked and the 
consequent effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be screened 
20 for an ability to bind to a ovarian cancer polypeptide or to modulate activity. 
Conventionally, new chemical entities with useful properties are generated by identifying 
a chemical compound (called a "lead compound") with some desirable property or 
activity, e.g., inhibiting activity, creating variants of the lead compound, and evaluating 
the property and activity of those variant compounds. Often, high throughput screening 
25 (HTS) methods are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
30 assays to Identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual 
therapeutics. 

35 A combinatorial chemical library is a collection of diverse chemical compounds generated 
by either chemical synthesis or biological synthesis by combining a number of chemical 
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"building blocks" such as reagents. For example, a linear combinatorial chemical library, 
such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length 
(i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
5 compounds are synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et at., 1994, J. Med. Chem. 37(9):1233-1251). 

Preparation and screening of combinatorial chemical libraries is well known to those of 
skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
10 peptide libraries, peptoids, encoded peptides, random bio-oligomers, nonpeptidal 
peptidomimetics, analogous organic syntheses of small compound libraries, nucleic acid 
libraries, peptide nucleic acid libraries, antibody libraries, carbohydrate libraries and small 
organic molecule libraries. 

15 The assays to identify modulators are amenable to high throughput screening. Preferred 
assays thus detect enhancement or inhibition of ovarian cancer gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

20 High throughput assays for the presence, absence, quantification, or other properties of 
particular nucleic acids or protein products are well known to those of skill in the art. 
Similarly, binding assays and reporter gene assays are similarly well known. Thus, e.g., 
U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 

25 binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 

30 Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These 
systems typically automate entire procedures, including all samlsle and reagent pipetting, 
liquid dispensing, timed incubations, and final readings of the microplate in detectors) 
appropriate for the assay. These configurable systems provide high throughput and rapid 
start up as well as a high degree of flexibility and customization. The manufacturers of 

35 such systems provide detailed protocols for various high throughput systems. Thus, e.g., 
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Zymark Corp. provides technical bulletins describing screening systems for detecting the 
modulation of gene transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 
5 fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, 
or random or directed digests of proteinaceous cellular extracts, are used. In this way 
libraries of proteins are made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian 
proteins, with the latter being preferred, and human proteins being especially preferred. 
10 Particularly useful test compound will be directed to the class of proteins to which the 
target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 30 amino 
acids, with from about 5 to about 20 amino acids being preferred, and from about 7 to 

15 about 15 being particularly preferred. The peptides are digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and 
peptide consists of essentially random nucleotides and amino acids, respectively. Since 
generally these random peptides (or nucleic acids, discussed below) are chemically 

20 synthesized, they may incorporate any nucleotide or amino acid at any position, the 
synthetic process are designed to generate randomized proteins or nucleic acids, to 
allow the formation of all or most of the possible combinations over the length of the 
sequence, thus forming a library of randomized candidate bioactive proteinaceous 
agents. 

25 

In one embodiment, the library is fully randomized, with no sequence preferences or 
constants at any position. In a preferred embodiment, the library is biased. That is, some 
positions within the sequence are either held constant, or are selected from a limited 
number of possibilities. For example, in a preferred embodiment, the nucleotides or 
30 amino acid residues are randomized within a defined class, e.g., of hydrophobic amino 
acids, hydrophilic residues, sterically biased (either small or large) residues, towards the 
creation of nucleic acid binding domains, the creation of cysteines, for cross-linking, 
prolines for SH-3 domains, serines, threonines, tyrosines or histidines for 
phosphorylation sites, etc., or to purines, etc. 
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Modulators of ovarian cancer can also be nucleic acids, as defined below. As described 
above generally for proteins, nucleic acid modulating agents are naturally occurring 
nucleic acids, random nucleic acids, or "biased" random nucleic acids. For example, 
digests of procaryotic or eucaryotic genomes are used as is outlined above for proteins. 

5 

In certain embodiments, the activity of a ovarian cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a 
nucleic acid complementary to, and which can preferably hybridize specifically to, a 
coding mRNA nucleic acid sequence, e.g., a ovarian cancer-associated protein mRNA, 
10 or a subsequence thereof. Binding of the antisense polynucleotide to the mRNA reduces 
the translation and/or stability of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 

15 subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and 
other sulfur containing species which are known for use in the art. Analogs are 
comprehended by this invention so long as they function effectively to hybridize with the 
ovarian cancer-associated protein mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; 

20 Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant means, or 
are synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
25 phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense oligonucleotides. Sense 
oligonucleotides can, e.g., be employed to block transcription by binding to the 
anti-sense strand. The antisense and sense oligonucleotide comprise a single-stranded 

30 nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) 
or DNA (antisense) sequences for ovarian cancer molecules. Antisense or sense 
oligonucleotides, according to the present invention, comprise a fragment generally at 
least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to 
derive an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding 

35 a given protein is described in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 and van 
der Krol et al. (BioTechniques 6:958 (1988)). 
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In addition to antisense polynucleotides, ribozymes are used to target and inhibit 
transcription of ovarian cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
5 have been described, including group I ribozymes, hammerhead ribozymes, hairpin 
ribozymes, RNase P, and axhead ribozymes (see, e.g., Castanotto et al., Adv. in 
Pharmacology 25: 289-317 (1994) for a general review of the properties of different 5 
ribozymes). 

10 Methods of preparing ribozymes are well known to those of skill in the art (see, e.g., WO 
94/26877; Ojwang et al., Proc. Natl. Acad. Sci. USA 90:6340-6344 (1993); Yamada et 
al., Human Gene Therapy 1:39-45 (1994); Leavitt et al., Proc. Natl. Acad. Sci. USA 
92:699- 703 (1995); Leavitt et al., Human Gene Therapy 5:1151-120 (1994); and 
Yamada et al., Virology 205: 121-126 (1994)). 

15 

Polynucleotide modulators of ovarian cancer are introduced into a cell containing the 
target nucleotide sequence by formation of a conjugate with a ligand binding molecule, 
as described in WO 91/04753. Suitable ligand binding molecules include, but are not 
limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 

20 bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule 
does not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense 
oligonucleotide or its conjugated version into the cell. Alternatively, a polynucleotide 
modulator of ovarian cancer are introduced into a cell containing the target nucleic acid 

25 sequence, e.g., by formation of an polynucleotide-lipid complex, as described in WO 
90/10448. It is understood that the use of antisense molecules or knock out and knock in 
models may also be used in screening assays as discussed above, in addition to 
methods of treatment. 

30 As noted above, gene expression monitoring is conveniently used to test candidate 
modulators (e.g., protein, nucleic acid or small molecule). After the candidate agent has 
been added and the cells allowed to incubate for some period of time, the sample 
containing a target sequence to be analyzed is added to the biochip. If required, the 
target sequence is prepared using known techniques. For example, the sample are 

35 treated to lyse the cells, using known lysis buffers, electroporation, etc., with purification 
and/or amplification such as PCR performed as appropriate. For example, an in vitro 
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transcription with labels covalently attached to the nucleotides is performed. Generally, 
the nucleic acids are labeled with biotin-FlTC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 
chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting 
the target sequence's specific binding to a probe. The label also are an enzyme, such as, 
alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that are detected. Alternatively, the label are a 
labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also are a moiety or compound, such as, 
an epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, 
the streptavidin is labeled as described above, thereby, providing a detectable signal for 
the bound target sequence. Unbound labeled streptavidin is typically removed prior to 
analysis. 

As will be appreciated by those in the art, these assays are direct hybridization assays or 
can comprise "sandwich assays", which include the use of multiple probes, as is 
generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,117, 
5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 
5,594,118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex. 

A variety of hybridization conditions are used in the present invention, including high, 
moderate and low stringency conditions as outlined above. The assays are generally run 
under stringency conditions which allows formation of the label probe hybridization 
complex only in the presence of target. Stringency are controlled by altering a step 
parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 
outlined in U.S. Patent No. 5,681,697. Thus it are desirable to perform certain steps at 
higher stringency conditions to reduce non-specific binding. 
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The reactions outlined herein are accomplished in a variety of ways. Components of the 
reaction are added simultaneously, or sequentially, in different orders, with preferred 
embodiments outlined below. In addition, the reaction may include a variety of other 
reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
5 which are used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, efc., may 
also be used as appropriate, depending on the sample preparation methods and purity of 
the target. 

10 

The assay data are analyzed to determine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression 
profile. 

15 Screens are performed to identify modulators of the ovarian cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
particular expression profile, thus preferably generating the associated phenotype. In 
another embodiment, e.g., for diagnostic applications, having identified differentially 
expressed genes important in a particular state, screens are performed to identify 

20 modulators that alter expression of individual genes. In an another embodiment, 
screening is performed to identify modulators that alter a biological function of the 
expression product of a differentially expressed gene. Again, having identified the 
importance of a gene in a particular state, screens are performed to identify agents that 
bind and/or modulate the biological activity of the gene product. 

25 

In addition screens are done for genes that are induced in response to a candidate 
agent. After identifying a modulator based upon its ability to suppress a ovarian cancer 
expression pattern leading to a normal expression pattern, or to modulate a single 
ovarian cancer gene expression profile so as to mimic the expression of the gene from 

30 normal tissue, a screen as described above are performed to identify genes that are 
specifically modulated in response to the agent. Comparing expression profiles between 
normal tissue and agent treated ovarian cancer tissue reveals genes that are not 
expressed in normal tissue or ovarian cancer tissue, but are expressed in agent treated 
• tissue. These agent-specific sequences are identified and used by methods described 

35 herein for ovarian cancer genes or proteins. In particular these sequences and the 
proteins they encode find use in marking or identifying agent treated cells. In addition, 
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antibodies are raised against the agent induced proteins and used to target novel 
therapeutics to the treated ovarian cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of ovarian 
cancer cells, that have an associated ovarian cancer expression profile. By 
"administration" or "contacting" herein is meant that the candidate agent is added to the 
cells in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) are put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression 
of the peptide agent is accomplished. Regulatable gene administration systems can also 
be used. 

Once the test compound has been administered to the cells, the cells are washed if 
desired and are allowed to incubate under preferably physiological conditions for some 
period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, e.g., ovarian cancer tissue are screened for agents that modulate, e.g., induce or 
suppress the ovarian cancer phenotype. A change in at least one gene, preferably many, 
of the expression profile indicates that the agent has an effect on ovarian cancer activity. 
By defining such a signature for the ovarian cancer phenotype, screens for new drugs 
that alter the phenotype iare devised. With this approach, the drug target need not be 
known and need not be represented in the original expression screening platform, nor 
does the level of transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens are done on individual genes and 
gene products (proteins). That is, having identified a particular differentially expressed 
gene as important in a particular state, screening of modulators of either the expression 
of the gene or the gene product itself are done. The gene products of differentially 
expressed genes are sometimes referred to herein as "ovarian cancer-associated 
proteins" or a "ovarian cancer modulatory protein". The ovarian cancer modulatory 
protein are a fragment, or alternatively, be the full length protein to the fragment encoded 
by the nucleic acids referred to in Tables 1-3. Preferably, the ovarian cancer modulatory 
protein is a fragment. In a preferred embodiment, the ovarian cancer amino acid 
sequence which is used to determine sequence identity or similarity is encoded by a 
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nucleic acid referred to in Tables 1-3. In another embodiment, the sequences are 
naturally occurring allelic variants of a protein encoded by a nucleic acid referred to in 
Tables 1-3. In another embodiment, the sequences are sequence variants as further 
described herein. 

5 

Preferably, the ovarian cancer modulatory protein is a fragment of approximately 14 to 24 
amino acids long. More preferably the fragment is a soluble fragment. Preferably, the 
fragment includes a non-transmembrane region. In a preferred embodiment, the 
fragment has an N-terminal Cys to aid in solubility. In one embodiment, the C-terminus of 
10 the fragment is kept as a free acid and the N-terminus is a free amine to aid in coupling, 
i.e., to cysteine. 

In one embodiment the ovarian cancer-associated proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the ovarian cancer- 
15 associated protein is conjugated to BSA. 

Measurements of ovarian cancer polypeptide activity, or of ovarian cancer or the ovarian 
cancer phenotype are performed using a variety of assays. For example, the effects of 
the test compounds upon the function of the ovarian cancer polypeptides are measured 

20 by examining parameters described above. A suitable physiological change that affects 
activity are used to assess the influence of a test compound on the polypeptides of this 
invention. When the functional consequences are determined using intact cells or 
animals, one can also measure a variety of effects such as, in the case of ovarian cancer 
associated with tumours, tumour growth, tumour metastasis, neovascularization, 

25 hormone release, transcriptional changes to both known and uncharacterized genetic 
markers (e.g., northern blots), changes in cell metabolism such as cell growth or pH 
changes, and changes in intracellular second messengers such as cGMP. In tire assays 
of the invention, mammalian ovarian cancer polypeptide is typically used, e.g., mouse, 
preferably human. 

30 

Assays to identify compounds with modulating activity are performed in vitro. For 
example, a ovarian cancer polypeptide is first contacted with a potential modulator and 
incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the ovarian cancer polypeptide levels are determined in vitro by measuring the level of 
35 protein or mRNA. The level of protein is measured using immunoassays such as western 
blotting, ELISA and the like with an antibody that selectively binds to the ovarian cancer 
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polypeptide or a fragment thereof. For measurement of mRNA, amplification, e.g., using 
PCR, LCR, or hybridization assays, e.g., northern hybridization, RNAse protection, dot 
blotting, are preferred. The level of protein or mRNA is detected using directly or 
indirectly labeled detection agents, e.g., fluorescently or radioactively labeled nucleic 
5 acids, radioactively or enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system are devised using the ovarian cancer-associated 
protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or (beta-gal. The reporter construct is typically transfected into a cell. After 
10 treatment with a potential modulator, the amount of reporter gene transcription, 
translation, or activity is measured according to standard techniques known to those of 
skill in the art. 

In a preferred embodiment, as outlined above, screens are done on individual genes and 
15 gene products (proteins). That is, having identified a particular differentially expressed 
gene as important in a particular state, screening of modulators of the expression of the 
gene or the gene product itself are done. The gene products of differentially expressed 
genes are sometimes referred to herein as "ovarian cancer-associated proteins." The 
ovarian cancer-associated protein are a fragment, or alternatively, be the full length 
20 protein to a fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
25 expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are 
identified, variants are further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated gene 
30 product is used; that is, the gene products of one or more differentially expressed nucleic 
acids are made. For example, antibodies are generated to the protein gene products, 
and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the ovarian cancer-associated proteins are used in the 
assays. 

35 
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Thus, in a preferred embodiment, the methods comprise combining a ovarian cancer- 
associated protein and a candidate compound, and determining the binding of the 
compound to the ovarian cancer-associated protein. Preferred embodiments utilize the 
human ovarian cancer-associated protein, although other mammalian proteins may also 
5 be used, e.g. for the development of animal models of human disease. In some 
embodiments, as outlined herein, variant or derivative ovarian cancer-associated 
proteins are used. 

Generally, in a preferred embodiment of the methods herein, the ovarian cancer- 

10 associated protein or the candidate agent is non-diffusably bound to an insoluble support 
having isolated sample receiving areas (e.g. a microtiter plate, an array, etc.). The 
insoluble supports are made of any composition to which the compositions are bound, is 
readily separated from soluble material, and is otherwise compatible with the overall 
method of screening. The surface of such supports are solid or porous and of any 

15 convenient shape. Examples of suitable insoluble supports include microtiter plates, 
arrays, membranes and beads. These are typically made of glass, plastic (e.g., 
polystyrene), polysaccharides, nylon or nitrocellulose, teflon™, etc. microtitre plates and 
arrays are especially convenient because a large number of assays are carried out 
simultaneously, using small amounts of reagents and samples. The particular manner of 

20 binding of the composition is not crucial so long as it is compatible with the reagents and 
overall methods of the invention, maintains the activity of the composition and is 
nondiffusabie. Preferred methods of binding include the use of antibodies (which do not 
sterically block either the ligand binding site or activation sequence when the protein is 
bound to the support), direct binding to "sticky" or ionic supports, chemical crosslinking, 

25 the synthesis of the protein or agent on the surface, etc. Following binding of the protein 
or agent, excess unbound material is removed by washing. The sample receiving areas 
may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

30 In a preferred embodiment, the ovarian cancer-associated protein is bound to the 
support, and a test compound is added to the assay. Alternatively, the candidate agent is 
bound to the support and the ovarian cancer-associated protein is added. Novel binding 
agents include specific antibodies, non-natural binding agents identified in screens of 
chemical libraries, peptide analogs, etc. Of particular interest are screening assays for 

35 agents that have a low toxicity for human cells. A wide variety of assays are used for this 
purpose, including labeled in vitro protein-protein binding assays, electrophoretic mobility 
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shift assays, immunoassays for protein binding, functional assays (phosphorylation 
assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the ovarian cancer- 
5 associate'd protein are done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by attaching all or a portion 
of the ovarian cancer-associated protein to a solid support, adding a labeled candidate 
agent (e.g., a fluorescent label), washing off excess reagent, and determining whether 
the label is present on the solid support. Various blocking and washing steps are utilized 
10 as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the proteins (or 
proteinaceous candidate compounds) are labeled. Alternatively, more than one 
component are labeled with different labels, e.g., 125 l for the proteins and a fluorophor for 
15 the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

In one embodiment, the binding of the test compound is determined by competitive 
binding assay. The competitor is a binding moiety known to bind to the target molecule 

20 (i.e., a ovarian cancer-associated protein), such as an antibody, peptide, binding partner, 
ligand, etc. Under certain circumstances, there are competitive binding between the 
compound and the binding moiety, with the binding moiety displacing the compound. In 
one embodiment, the test compound is labeled. Either the compound, or the competitor, 
or both, is added first to the protein for a time sufficient to allow binding, if present. 

25 Incubations are performed at a temperature which facilitates optimal activity, typically 
between 4 and 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid 
high throughput screening. Typically between 0.1 and 1 hour will be sufficient. Excess 
reagent is generally removed or washed away. The second component is then added, 
and the presence or absence of the labeled component is followed, to indicate binding. 

30 

In a preferred embodiment, the competitor is added first, followed by the test compound. 
Displacement of the competitor is an indication that the test compound is binding to the 
ovarian cancer-associated protein and thus is capable of binding to, and potentially 
modulating, the activity of the ovarian cancer-associated protein. In this embodiment, 
35 either component are labeled. Thus, e.g., if the competitor is labeled, the presence of 



WO 2004/022778 PCT/AU2003/001166 

107 

label in the wash solution indicates displacement by the agent. Alternatively, if the test 
compound is labeled, the presence of the label on the support indicates displacement. 

In an alternative preferred embodiment, the test compound is added first, with incubation 
5 and washing, followed by the competitor. The absence of binding by the competitor may 
indicate that the test compound is bound to the ovarian cancer-associated protein with a 
higher affinity. Thus, if the test compound is. labeled, the presence of the label on the 
support, coupled with a lack of competitor binding, may indicate that the test compound 
is capable of binding to the ovarian cancer-associated protein. 

10 

In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the ovarian cancer-associated 
proteins. In this embodiment, the methods comprise combining a ovarian cancer- 
associated protein and a competitor in a first sample. A second sample comprises a test 

15 compound, a ovarian cancer-associated protein, and a competitor. The binding of the 
competitor is determined for both samples, and a change, or difference in binding 
between the two samples indicates the presence of an agent capable of binding to the 
ovarian cancer-associated protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 

20 agent is capable of binding to the ovarian cancer-associated protein. 

Alternatively, differential screening is used to identify drug candidates that bind to the 
native ovarian cancer-associated protein, but cannot bind to modified ovarian cancer- 
associated proteins. The structure of the ovarian cancer-associated protein are modeled, 
25 and used in rational drug design to synthesize agents that interact with that site. Drug 
candidates that affect the activity of a ovarian cancer-associated protein are also 
identified by screening drugs for the ability to either enhance or reduce the activity of the 
protein. 

30 Positive controls and negative controls are used in the assays. Preferably control and 
test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, samples are washed free of non-specifically bound material and 
the amount of bound, generally labeled agent determined. For example, where a 

35 radiolabel is employed, the samples are counted in a scintillation counter to determine 
the amount of bound compound. 
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A variety of other reagents are included in the screening assays. These include reagents 
like salts, neutral proteins, e.g. albumin, detergents, etc. which are used to facilitate 
optimal protein-protein binding and/or reduce non-specific or background interactions. 
5 Also reagents that otherwise improve the efficiency of the assay, such as protease 
inhibitors, nuclease inhibitors, anti-microbial agents, etc., are used. The mixture of 
components are added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a compound 
10 capable of modulating the activity of a ovarian cancer-associated protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising ovarian cancer- 
associated proteins. Preferred cell types include almost any cell. The cells contain a 
recombinant nucleic acid that encodes a ovarian cancer-associated protein. In a 
preferred embodiment, a library of candidate agents are tested on a plurality of cells. . 

15 

In one aspect, the assays are evaluated in the presence or absence or previous or 
subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In 
20 another example, the determinations are determined at different stages of the cell cycle 
process. 

In this way, compounds that modulate ovarian cancer agents are identified. Compounds 
with pharmacological activity are able to enhance or interfere with the activity of the 
25 ovarian cancer-associated protein. Once identified, similar structures are evaluated to 
identify critical structural feature of the compound. 

In one embodiment, a method of inhibiting ovarian cancer cell division is provided. The 
method comprises administration of a ovarian cancer inhibitor. In another embodiment, a 
30 method of inhibiting ovarian cancer is provided. The method comprises administration of 
a ovarian cancer inhibitor. In a further embodiment, methods of treating cells or 
individuals with ovarian cancer are provided. The method comprises administration of a 
ovarian cancer inhibitor. 

35 In one embodiment, a ovarian cancer inhibitor is an antibody as discussed above. In 
another embodiment, the ovarian cancer inhibitor is an antisense molecule. 
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A variety of cell growth, proliferation, and metastasis assays are known to those of skill in 
the art, as described below. 

5 Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For 
example, transformed cells can grow in stirred suspension culture or suspended in 
semi-solid media, such as semi-solid or soft agar. The transformed cells, when 

10 transfected with tumour suppressor genes, regenerate normal phenotype and require a 
solid substrate to attach and grow. Soft agar growth or colony formation in suspension 
assays are used to identify modulators of ovarian cancer sequences, which when 
expressed in host cells, inhibit abnormal cellular proliferation and transformation. A 
therapeutic compound would reduce or eliminate the host cells' ability to grow in stirred 

15 suspension culture or suspended in semisolid media, such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are described 
in Freshney, Culture of Animal Cells a Manual of Basic Technique (3rd ed., 1994), herein 
incorporated by reference. See also, the methods section of Garkavtsev et a/. (I996), 
20 supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they touch 
other cells. When the cells touch one another, they are contact inhibited and stop 

25 growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow 
to a higher saturation density than normal cells. This are detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 
pattern of normal surrounding cells. Alternatively, labeling index with ( 3 H)-thymicline at 

30 saturation density are used to measure density limitation of growth. See Freshney 
(1994), supra. The transformed cells, when transfected with tumour suppressor genes, 
regenerate a normal phenotype and become contact inhibited and would grow to a lower 
density. 

35 In this assay, labeling index with ( 3 H)-thymidine at saturation density is a preferred 
method of measuring density limitation of growth. Transformed host cells are transfected 
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with a ovarian cancer-associated sequence and are grown for 24 hours at saturation 
density in non-limiting medium conditions. The percentage of cells labeling with 
( 3 H)-thymidine is determined autoradiographically. See, Freshney (1994), supra. 

5 Growth factor or serum dependence 

Transformed cells have a lower serum dependence than their normal counterparts (see, 
e.g., Temin, J. Natl. Cancer InstL 37:167-175 (1966); Eagle et a/., J. Exp. Med. 
131:836-879 (1970)); Freshney, supra, this is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 

10 cells are compared with that of control. Tumor specific markers levels Tumor cells 
release an increased amount of certain factors (hereinafter "tumour specific markers") 
than their normal counterparts. For example, plasminogen activator (PA) is released from 
human glioma at a higher level than from normal brain cells (see, e.g., Gullino, 
Angiogenesis, tumour vascularization, and potential interference with tumour growth, in 

15 Biological Responses in Cancer, pp. 178-184 (Mihich (ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumour cells than their normal 
counterparts. See, e.g., Folkman, Angiogenesis and Cancer, Sem Cancer Biol. (1992)). 
Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. Also, see, Unkless et at. , J. Biol. Chem. 249:4295-4305 (1974); 

20 Strickland & Beers, J. Biol. Chem. 251:5694-5702 (1976); Whur et ai, Br. J. Cancer 
42:305 312 (1980); Gullino, Angiogenesis, tumour vascularization, and potential 
interference with tumour growth, in Biological Responses in Cancer, pp. 178-184 (Mihich 
(ed.) 1985); Freshney Anticancer Res. 5:111-130 (1985). 

25 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel-or some other extracellular matrix constituent 
are used as an assay to identify compounds that modulate ovarian cancer-associated 
sequences. Tumor cells exhibit a good correlation between malignancy and invasiveness 
of cells into Matrigel or some other extracellular matrix constituent. In this assay, 

30 tumourigenic cells are typically used as host cells. Expression of a tumour suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, are used. Briefly, the level of invasion 
of host cells are measured by using filters coated with Matrigel or some other 
35 extracellular matrix constituent. Penetration into the gel, or through to the distal side of 
the filter, is rated as invasiveness, and rated histologically by number of cells and 
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distance moved, or by prelabeling the cells with 125 1 and counting the radioactivity on 
the distal side of the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 

Tumor growth in vivo 

5 Effects of ovarian cancer-associated sequences on cell growth are tested in transgenic 
or immune-suppressed mice. Knock-out transgenic mice are made, in which the ovarian 
cancer gene is disrupted or in which a ovarian cancer gene is inserted. Knock- out 
transgenic mice are made by insertion of a marker gene or other heterologous gene into 
the endogenous ovarian cancer gene site in the mouse genome via homologous 
10 recombination. Such mice can also be made by substituting the endogenous ovarian 
cancer gene with a mutated version of the ovarian cancer gene, or by mutating the 
endogenous ovarian cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells containing 
15 the newly engineered genetic lesion are injected into a host mouse embryo, which is 
re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. Therefore, by breeding 
the chimeric mice it is possible to obtain a new line of mice containing the introduced 
genetic lesion (see, e.g., Capecchi et al, Science 244:1288 (1989)). Chimeric targeted 
20 mice are derived according to Hogan et a/., Manipulating the Mouse Embryo: A 
Laboratory Manual, Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, Robertson, ed., IRL Press, Washington, 
D.C., (1987). 

25 Alternatively, various immune-suppressed or immune-deficient host animals are used. 
For example, genetically athymic "nude" mouse (see, e.g., Giovanella et al, J. Natl. 
Cancer Inst 52:921 (1974)), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley et al., Br. J. Cancer 38:263 (1978); Selby et al, Br. J. Cancer 
41:52 (1980)) are used as a host. Transplantable tumour cells (typically about 10 6 cells) 

30 injected into isogenic hosts will produce invasive tumours in a high proportions of cases, 
while normal cells of similar origin will not. In hosts which developed invasive tumours, 
cells expressing a ovarian cancer-associated sequences are injected subcutaneously. 
After a suitable length of time, preferably 4 to 8 weeks, tumour growth is measured (e.g. 
by volume or by its two largest dimensions) and compared to the control. Tumours that 

35 have a statistically significant reduction (using, e.g. Student's T test) are said to have 
inhibited growth. 
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Administration 

therapeutic reagents of the invention are administered to patients, therapeutically. 
Typically, such proteins/polynucleotides and substances may preferably be combined 
5 with various components to produce compositions of the invention. Preferably the 
compositions are combined with a pharmaceutical^ acceptable carrier or diluent to 
produce a pharmaceutical composition (which are for human or animal use). Suitable 
carriers and diluents include isotonic saline solutions, for example phosphate-buffered 
saline. The composition of the invention are administered by direct injection. The 
10 composition are formulated for parenteral, intramuscular, intravenous, subcutaneous, 
intraocular, oral, vaginal or transdermal administration. Typically, each protein are 
administered at a dose of from 0.01 to 30 mg/kg body weight, preferably from 0.1 to 10 
mg/kg, more preferably from 0.1 to 1 mg/kg body weight. 

15 Polynucleotides/vectors encoding polypeptide components for use in modulating the 
activity of the ovarian cancer-associated proteins/polynucleotides are administered 
directly as a naked nucleic acid construct. When the polynucleotides/vectors are 
administered as a naked nucleic acid, the amount of nucleic acid administered may 
typically be in the range of from 1 fjg to 10 mg, preferably from 100 ^ig to 1 mg. 

20 

Uptake of naked nucleic acid constructs by mammalian cells is enhanced by several 
known transfection techniques for example those including the use of transfection 
agents. Example of these agents Include cationic agents (for example calcium phosphate 
and DEAE-dextran) and lipofectants (for example lipofectam™ and transfectam™). 
25 Typically, nucleic acid constructs are mixed with the transfection agent to produce a 
composition. 

Preferably the polynucleotide or vector of the invention is combined with a 
pharmaceutically acceptable carrier or diluent to produce a pharmaceutical composition. 
30 Suitable carriers and diluents include isotonic saline solutions, for example phosphate- 
buffered saline. The composition are formulated for parenteral, intramuscular, 
intravenous, subcutaneous, oral, intraocular or transdermal administration. 

The pharmaceutical compositions are administered in a range of unit dosage forms 
35 depending on the method of administration. For example, unit dosage forms suitable for 
oral administration include, powder, tablets, pills, capsules and lozenges. Orally 
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administered dosage forms will typically be formulated to protect the active ingredient 
from digestion and may therefore be complexed with appropriate carrier molecules 
and/or packaged in an appropriately resistant carrier. Suitable carrier molecules and 
packaging materials/barrier materials are known in the art. 

The compositions of the invention are administered for therapeutic or prophylatic 
treatments. In therapeutic applications, compositions are administered to a patient 
suffering from a disease (e.g. ovarian cancer) in an amount sufficient to cure or at least 
partially ameliorate the disease and its complications. An amount adequate to 
accomplish this is defined as a "therapeutically effective dose". An amount of the 
composition that is capable of preventing or slowing the development of cancer in a 
patient is referred to as a "prophylactically effective dose". 

The routes of administration and dosages described are intended only as a guide since a 
skilled practitioner will be able to determine readily the optimum route of administration 
and dosage for any particular patient and condition. 

The present invention is further described with reference to the accompanying drawings 
and the following non-limiting examples. 
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EXAMPLE 1 

Gene expression profiling to identify differentially-expressed 
genes in ovarian cancer 

1. Tissue Bank and Database 

5 Tissue was collected from patients undergoing treatment at the GCC, we have established 
an Ovarian Cancer Tissue Bank and Clinical Database that currently holds data on over 
400 cases treated at the GCC between 1986 and 2002. Tissue (currently 149 fresh/frozen 
and 292 archival fixed paraffin-embedded samples) was acquired from patients undergoing 
cytoreductive surgery and does not interfere with the collection of tissue for the normal 

10 processing of diagnostic specimens. Patient consent, included in all our studies, was 
collected prior to surgery. Tissue specimens and their associated pathology reports were 
coded in order to maintain patient confidentiality. Uncoded data was electronically and/or 
physically locked with restricted access by appropriate senior investigators only. Clinical 
(diagnosis, treatment, residual disease) and pathological data (tumour grade, stage) were 

15 collected and updated (disease recurrence, patient survival) at regular intervals. This study 
has ethical approval from the South Eastern Sydney Area Health Service Research Ethics 
Committee, Australia. Clinical data and tissue collection are ongoing. 

2. Genetic profiling of ovarian cancers 

20 In order to identify those genes differentially regulated in epithelial ovarian cancer 51 
ovarian cancer tumor samples were manually dissected from biological samples derived 
from subjects undergoing cytoreductive surgery. These samples comprised 8 
endometrioid tumors, 4 mucinous tumors and 31 serous epithelial ovarian tumors, 12 
corresponding omental deposits and 8 borderline (low-malignant potential) tumors. 

25 

RNA was isolated from the tumor samples in addition to 4 normal ovary samples using 
Trizol reagent (Life Technologies, Rockville, MD, USA) essentially according to 
manufacturer's instructions. RNA was then reverse transcribed using an oligo(dT) 
anchored oligonucleotide that additionally comprised a T7 promoter sequence. Isolated 
30 cDNA was then transcribed in vitro using the T7 MEGAscript kit (Ambion, Austin, TX, 
USA) according to manufacturer's instructions. Transcription was performed with 
biotinylated nucleotides (Bio-11-CTP and Bio-16-UTP) to enable detection of the 
transcribed cRNA. 

35 Levels of gene expression in the cancer samples was then determined by analysing the 
transcribed cDNA samples using customized Affymetrix GeneChip® microarrays that 
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comprise 59,618 oligonucleotide probe sets. These probe sets facilitate analysis of 
46,000 gene clusters, representing over 90% of the predicted expressed human genome. 

Data were normalized, and changes in gene expression detected using a ranked 
5 penalized t-statistic with p-values adjusted for multiple testing using the Holm procedure. 
Analysis was performed using the LIMMA package (available from Bioconductor, 
Biostatistics Unit of the Dana Farber Cancer Institute at the Harvard Medical 
School/Harvard School of Public Health). 

10 Gene expression in 186 samples representing 52 different tissues of the body was also 
determined using the previously described methods to facilitate the identification of 
changes in gene expression that are specific for ovarian cancer. 

Using this method 284 up-regulated transcripts and 186 down-regulated transcripts were 
15 identified. 

In order to determine the efficacy of such a method of analysis for determining gene 
expression changes associated with ovarian cancer, those genes identified were 
compared to results of published expression profile studies. Using this method, 71 
20 genes were identified in the present study that had been previously identified, including, 
for example, genes known to be over-expressed in ovarian cancer, such as, for example 
MUC1 and E-cadherin. 

The ovarian cancer-associated genes and proteins set forth in Table 1 include 
25 sequences that are up-regulated or down-regulated in ovarian cancer subjects, including 
subjects suffering specifically from serous, encodmetrioid, mucinous or clear cell ovarian 
cancer, or non-invasive (borderline) ovarian cancers of any phenotype, and subjects that 
suffered from recurrences of ovarian cancer in the medium term, or died within the 
medium term. 

30 

Data presented in Table 2 indicate those genes that are expressed at significantly higher 
levels or significantly reduced levels in patients suffering from serous cancer relative tot 
he level of expression of the same genes in a normal or healthy subject 
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EXAMPLE 2 

Validation of gene expression profiling results using tissue microarrays 
Each of the transcripts identified as being differentially-expressed specifically in ovarian 
cancer was then further analysed using in situ hybridization or immunohistochemical 
staining of tissue microarrays constructed from a large cohort of primary ovarian tumor 
tissue. Such analysis confirms upregulation, down-regulation or total loss of expression 
of the transcripts identified in the microarray analysis of tumor samples. 

Furthermore, as each of the samples in the tissue microarray have been 
clinicopathologically characterized (for example to identify cancer grade and/or disease 
stage) and the subjects from whom the tumors were isolated continuously monitored (to 
detect for example, death or relapse of cancer), changes with gene expression were also 
analysed for correlation with such parameters in order to determine predictive changes in 
gene expression. 

The relative intensity and percentage of cells staining was determined and evaluated for 
associations with clinical stage and grade of disease and disease relapse usirig the 
Kaplan Meier method and log-rank test, and by univariate and bivariate analyses in a 
Cox proportional hazards model for gene expression and other clinical and pathologic 
predictors of outcome to determine the potential independent prognostic value of the 
markers being assessed. 

Immunohistochemical analysis has been performed on several genes identified in gene 
profiling analysis of ovarian cancer samples. For example, SOX17, Ep-CAM and claudin 
3 were shown by gene profiling analysis to be specifically up-regulated in ovarian cancer 
compared to normal ovaries (Figure 1 and Figure 2). Using immunohistochemical 
analysis, it was determined that SOX17, Ep-CAM and claudin 3 are upregulated in 
serous cancer, mucinous cancer, endometroid cancer and clear cell ovarian cancer. 

Furthermore, immunohistochemical analysis has been used to analyse the expression of 
several other genes that are specifically upregulated in mucinous ovarian cancer. In 
particular the expression of Ll-cadherin (cadherin 17), meprin alpha and Galectin 4 as 
detected using immunohistochemistry is shown in Figure 3. There was a significant 
increase in protein detected in the mucinous ovarian cancer samples compared to the • 
normal ovary sample and serous ovarian cancer sample. 
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Immunohistochemical analysis was also performed to analyse the expression of three 
genes that are known to be upregulated in ovarian cancer (CA125, MUC-1 and E- 
cadherin) (Figures 1 and 2). 

EXAMPLE 3 

Identification of prognostic markers of ovarian cancer 
Using a classical survival analysis to mine expression profiling data several genes that 
are associated with poor patient outcome (ie death or cancer relapse) have been 
identified (Tables 2 and 3). Such genes have clinical utility as prognostic indicators of 
disease. . 

Using detailed clinicopathological and postoperative data on all of the 51 patients 
included in our transcriptional profiling studies, including details of biochemical (eg. rising 
serum CA-125) and/or clinical recurrence of disease and overall survival, expression 
15 profiles were correlates with clinical parameters. 

A preliminary survival analysis was performed on the 33 serous cancers within this 
cohort. The median follow-up time for these patients was 25.5 months from the date of 
primary laparotomy to the date of last follow-up or the date of death, and 21 of these 
20 patients (66%) were deceased from causes related to their malignancy. 

Preliminary analysis of the expression profiles of these tumors identified several potential 
gene clusters that were associated with an increased risk of biochemical and clinical 
recurrence and overall survival, including the EDD gene (SEQ ID NO: 63). Exemplary 
25 prognostic markers for detecting ovarian cancer are shown in Tables 1 and 3. Preferred 
markers are indicated in Table 3. 

Using immunohistochemical analysis two genes have been confirmed to be upregulated . 
in serous ovarian cancer. In particular, sFRP4, a negative signalling protein of the Wnt 
30 pathway, and SOCS3, a negative signaller of IL-6 induced signalling are specifically 
upregulated in serous ovarian cancer when compared to normal ovarian tissue (Figure 
4A). 

Furthermore, using clinical patient data and correlating this information with gene 
35 expression levels using a Cox proportional hazards model, it has been shown that high 
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expression of sFRP4 correlates with a poor outcome in patients (n=127) with serous 
ovarian cancer (p=0.0056) (Figure 4B). 

EXAMPLE 4 

Validation of gene expression profiling results using quantitative RT-PCR 
Candidate diagnostic genes are screened by quantitative RT-PCR against ovarian 
cancer cell lines to both validate the transcript profiling data (ie check their up- or down- 
regulation). Candidate diagnostic genes are screened using mRNA isolated from a panel 
of 9 ovarian tumour cell lines, (A2780, SKOV3, OVCAR-3, IGROV-1, CAOV3, OV-90, 
SW626, TOV-21G and TOV-112D), in addition to several other tumour cell lines including 
lines derived from breast, prostate and colorectal tumours, and immortalised (non- 
transformed) human ovarian surface epithelial cells and a primary normal breast 
epithelial cell line (184). 

Total RNA is isolated from the normal and tumour cell lines, reverse transcribed into 
cDNA and used as template in a quantitative PCR using a LightCycler system (Roche 
Diagnostics). The relative amount of each gene product is determined by comparison to 
a standard housekeeping gene (GAPDH). 

EXAMPLES 

Identification of Novel Genes for Diagnosis of Ovarian Cancer 
We identified candidate genes with diagnostic potential from our list of aberrantly 
regulated genes by applying the following selection procedure: genes with a good 
transcript profile and low p-value (ie highly significantly up- or down-regulated in ovarian 
cancer, as determined in Example 1); and mapping to areas of the genome that have 
been shown to be amplified or lost in ovarian cancer. Accordingly, it is likely that these 
genes are involved in the development and progression of ovarian cancer (ie putative 
oncogenes and tumour suppressor genes). Additional parameters for analysis included 
known or putative function in oncogenesis (eg signal transduction, regulation of cellular 
proliferation, apoptosis etc); and association with other forms of other tumours. Genes 
identified in this analysis are shown in Table 3. 

One method for the diagnosis of cancer comprises detecting modified DNA shed by the 
developing tumour into the blood stream. This can include the detection of mutations in 
both oncogenes and tumour suppressor genes involved in the development and 
progression of ovarian cancer. Furthermore, it has been recently shown that aberrant 
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methylation of tumour suppressor genes, specifically hypermethylation of their gene 
promoters, frequently accompanies gene silencing in cancers, and indeed in some cases 
appears to be the predominant mechanism of gene silencing. 

5 Combined with the knowledge of tumour nucleic acids circulating in the blood that reflect 
the biological characteristics of a tumour, the detection of methylation-specific tumour 
suppressor gene signatures for any given tumour type has promise as a specific and , 
sensitive molecular test for detecting and monitoring cancer. Aberrant methylation is a 
frequent epigenetic event in epithelial ovarian cancer and many candidate tumour 
10 suppressor genes of epithelial ovarian cancer have been shown to be hypermethylated in 
epithelial ovarian cancer, such as, for example BRCA1. 

In particular, expression of the candidate tumor suppressor gene MCC, has been shown 
to be down-regulated in epithelial ovarian cancer compared to normal ovarian tissue. 

15 MCC appears to be involved in critical cell growth regulatory processes and maps to a 
chromosomal region hypothesised as containing a tumor suppressor gene in ovarian 
cancer. Furthermore, we have identified a CpG island within the predicted promoter 
sequence of the MCC gene, a critical feature of genes that are subject to gene silencing 
by hypermethylation and a known characteristic of tumor suppressor genes. Taken 

20 together these data strongly implicate MCC as a candidate tumor suppressor gene 
involved in epithelial ovarian cancer. 
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Table 4 



Correlation of expression between normal ovarian surface epithelium (OSE), non- 





CA125 


MUC-1 


E- 
cadherin 


CLDN3 


Ep-CAM 


SOX17 


OSE vs IC 


<0.0001 


<0.0001 


0.7251 


0.6132 


0.1573 


0.0854 


OSE vs. BL 


0.1765 


<0.0001 


0.0307 


0.3633 


0.0005 


0.2287 


OSE vs. CA 


0.5443 


<0.0001 


0.1687 


0.0008 


<0.0001 


0.6900 . 


ICvs. BL 


<0.0001 


O.0001 . 


0.1116 


0.7849 


0.0913 


0.2530 


ICvs. CA 


<o:oooi 


0.2707 


0.4147 


0.0071 


0.0002 


0.0544 


BL vs. CA 


0.0001 


O.0001 


0.0615 


<0.0001 


0.0011 


0.0152 



Table 5 

Correlation of gene expression with patient outcome (univariate 
analysis ie., expression alone without the influence of covariates) 

Univariate analysis for clinicopathological variables and CLDN3, Ep-CAM, SOX17, CA125, MUC1 and E-cadherin 
immunoreactivity with survival and relapse in 156 patients with epithelial ovarian cancer 



Disease Specific Survival Relapse Free Survival 

Univariate Univariate 



Variable 


Hazards ratio (95% CI) 


p-value 


Hazards ratio (95% CI) 


p-value 


Pathological tumor stage 










Stage 1 -3b vs. 3c -4b 


5.89 (3.214-10.79) 


<0.0001 


7.37 (3.26-16.63) 


<(M)001 


Tumor grade 










BL and G1 vs. G2 and G3 


5.508 (2.745-11.052) 


<0.0001 


7.02 (2.76-17.82) 


<0.0001 


Age 










<50 vs. >=50 


0.533 (0.288-0.988) 


0.0458 


0.62 (0.29-1.33) 


0.2221 


Residual Disease 










RD<1cmvs. >=1cm 


4.192 (2.671-6.580) 


<0.0001 


4.17(2.30-7.55) 


<0.0001 


CA125 level at diagnosis 










CA125<500vs. >500U/ml 


1.843 (1.102-3.080) 


0.0197 


2.292 (1.19-4.40) 


0.0128 


Performance Status 










PS<1vs, >1 


0.270 (0.133-0.549) 


0.0003 


0.53 (0.16-1.74) 


0.2965 


CLDN3 expression 










Membranous Score Ovs. >0 


2.794(1.012-7.718) 


0.0474 


2.521 (0.908-6.998) 


0.0758 


Membranous Score <1 vs. >1 


1.309 (0.763-2.246) 


0.3285 


1.952 (1.103-3.457) 


0.0217 


Ep-CAM expression 










Membranous Score <1vs. >1 


1.460 (0.809-2.634) 


0.2093 


2.041 (0.997-4.177) 


0.0509 


Membranous Score <2v$. >2 


1.041 (0.634-1.711) 


0.873 


1.449 (0.845-2.487) 


0.1779 


SOX1 7 expression 










Nuclear membranous Score Ovs. >0 


0.839 (0.514-1.368) 


0.481 


1.311 (0.72B-2.358) 


0.3667 


Nuclear membranous Score <1vs. >1 


1.407(0.615-3.218) 


0.4183 


1.037 (0.380-2.829) 


0.9437 


CA125 expression 










Membranous apical Score Ovs. >0 


2.581 (1.393-4.781) 


0.0026 


2.725(1.218-6.093) 


0.0146 


Membranous apical Score <1vs. >1 


1.637 (1.045-2.564) 


0.0313 


1.298 (0.731-2.307) 


0.3737 


MUC1 expression 










Membranous apical Score Ovs. >0 


2.479(0.343-17.898) 


0.368 


NA 




Membranous apical Score <1 vs. >1 


3.745 (1.176-11.926) 


0.0254 


6.432 (1.562-26.483) 


0.0099 


Membranous apical Score <2vs. >2 


1.814(0.898-3.664) 


0.0969 


3.893 (1.552-9.766) 


0.0038 


E-cadherin expression 










Membranous Score Ovs. >0 


0.806(0.493-1.318) 


0.3892 


0.837 (0.477-1.467) 


0.5341 


Membranous Score <1vs. >1 


1.331 (0.532-3.333) 


0.5411 


0.847 (0.263-2.731) 


0.7814 


Membranous Score <2vs. >2 


0.593 (0.082-4.284) 


0.6041 


0.913(0.125-6.646) 


0.9284 
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Table 6 

Correlation of gene expression with patient outcome (multivariate 
analysis ie looking at expression incorporating the influence of covariates) 



Multivariate analysis for univariate significant clinicopathologlcal variables and CLDN3, Ep-CAM. SOX17, CA125. MUC1 
and E-cadherin immunoreactivity with survival and relapse in 156 patients with epithelial ovarian cancer 



Disease Specific Survival _ 



Relapse Free Survival 



Variable 



Multivariate 
Hazards ratio (95% CI) 



Univariate 
p-varue Hazards ratio (95% CI) 



p-va!oe 



Pathological tumor stage 

Stage 1- 3b vs. 3c -4b 
Tumor grade 

BL and G1 vs. 62 and G3 

Age 

<50 vs . >=50 
Residual Disease 

RD<1cm vsl >=1cm 
CA125 level at diagnosis < 

CA125 <500 vs. >500U/ml 
Performance Status 

PS<1 vs. >1 

CLDN3 expression 

Membranous Score 0 vs. >0 
Membranous Score <1 vs. >1 

CA125 expression 

Membranous apical Score 0 vs. >0 
Membranous apical Score <1 vs. >1 

MUC1 expression 

Membranous apical Score 0 vs. >0 
Membranous apical Score <1 vs. >1 
Membranous apical Score <2 vs. >2 



5.66 (2.467-13.012) 
4.919(2.080-11.633) 
0.951 (0.482-1.877) 
2.974(1.783-4.959) 
1.148(0.625-2.109) 
0.286 (0.136-0.601) 

1.165 (0.325-4.183) 



0.917(0.415-2.025) 
1.664 (0.976-2.837) 



0.678(0.255-1.804) 



<0.0001 
0.0003 
0.8853 

<0.0001 
0.6563 
0.0009 

0.8145 



0.8302 
0.0612 



0.4361 



5.192(1.860-14.496) 
7.989(2.385-26.760) 

2.779 (1.433*5.393) 
1289 (0.659-2.520) 



0.953 (0.473-1.919) 
0.693(0.271-1.768) 



04)017 
0.0008 



0.0025 

0.4587 



0.8918 
0.4427 
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WE CLAIM: 

1. A method of detecting an ovarian cancer-associated transcript in a biological 
sample, the method comprising contacting the biological sample with a polynucleotide 

5 that selectively hybridizes to a sequence at least 80% identical to a sequence as shown 
in Table 1 or 2 or 3. 

2. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 

10 tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein a modified level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian cancer, and wherein said nucleic acid probe comprises a 
15 sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 
20 (ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81 and 83; 
25 (iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 46, 48, 50, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71,73, 75,77,79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
30 consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 

34, 36, 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82 and 84; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

35 
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3. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein a modified level of 

5 hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
10 selected from the group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 

23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 
67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 

15 group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 

33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 
77, 79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1, 5, 7, 9, 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 

20 33, 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 

77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 6, 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 
36, 38, 40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 

25 and 84; and 

(v) a sequence that is complementary to (i) or (ii) or (iii) or (iv). 

4. The method of claim 2 or 3 wherein the hybridization is enhanced in the sample 
from the subject being tested compared to the hybridization obtained for a sample from a 

30 control subject not having ovarian cancer. ' 

5. The method of claim 2 or 3 wherein the hybridization is reduced in the sample 
from the subject being tested compared to the hybridization obtained for a sample from a 
control subject not having ovarian cancer. 
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6. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein an enhanced level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 or 2 other than a nucleic acid having an Accession 
Number selected from the group consisting of NM_022117, NM_005460, 
NM_002387, AI745249 arid AI694200; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
or 2 other than a nucleic acid having an Accession Number selected from the 
group consisting of NM_022117, NM_005460, NMJ302387, AI745249 and 
AI694200; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 or 2 other than a nucleic acid having an Accession Number selected from 
the group consisting of NM_022117, NM_005460, NM_002387, AI745249 and 
AI694200; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

7. The method of claim 6 wherein the nucleic acid probe comprises a sequence 
selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 7, 9, 11, 13, 15, 17, 21, 23, 
25, 27, 29, 31, 33. 35, 37, 39, 41, 45, 46, 48, 52, 53, 55, 57, 59, 61, 63, 65, 67, 
69, 71, 73, 75, 77, 79, 81 and 83; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 7, 9, 11, 13. 15, 17, 21, 23, 25, 27, 29, 31, 33, 
35, 37. 39, 41, 45, 46, 48, 52. 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73. 75, 77. 
79, 81 and 83; 
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(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 7, 9. 11, 13, 15, 17, 21, 23, 25, 27, 29, 31, 33, 
35, 37, 39, 41, 45, 46, 48. 52, 53, 55, 57, 59. 61, 63, 65, 67, 69, 71, 73, 75, 77, 
79, 81 and 83; 

5 (iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84; 
and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
10 (ii) or (iii) or (iv). 

8. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 
tested with a nucleic acid probe for a time and under conditions sufficient for 

15 hybridization to occur and then detecting the hybridization wherein a reduced level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has an ovarian ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

20 (i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of NM_0221 17, NM_005460, NM_002387, AI745249 and AI694200; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 

25 and having an Accession Number selected from the group consisting of 

NM_022117, NM_005460, NM_002387, AI745249 and AI694200; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 and having an Accession Number selected from the group consisting of 

30 NM_0221 1 7, NM_005460, NM_002387, AI745249 and AI694200; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 



35 



9. The method of claim 8 wherein the nucleic acid probe comprises a sequence 
selected from the group consisting of: 
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(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
selected from the group consisting of SEQ ID NOs: 1, 3, and 5; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 

5 group consisting of SEQ ID NOs: 1 , 3, and 5; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
group consisting of SEQ ID NOs: 1 , 3, and 5; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, and 6; and 

10 (v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

10. The method according to any one of claims 1 to 9 wherein the ovarian cancer that 
is diagnosed is an epithelial ovarian cancer. 

15 

1 1 . The method according to any one of claims 1 to 9 wherein the ovarian cancer that 
is diagnosed is selected from the group consisting of serous ovarian cancer, non- 
invasive ovarian cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, 

20 Brenner cell and undifferentiated adenocarcinoma. 

12. The method according to claim 1 1 wherein the ovarian cancer that is diagnosed is 
selected from the group consisting of serous ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer and clear cell ovarian cancer. 

25 

13. A method of diagnosing a serous ovarian cancer in a human or animal subject 
being tested said method comprising contacting a biological sample from said subject 
being tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein a modified level of 

30 hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a serous ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
35 acid set forth in Table 2 or as set forth in Table 1 and having an Accession 
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Number selected from the group consisting of: U62801, D49441, X51630, And 
AB018305; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 2 
or as set forth in Table 1 and having an Accession Number selected from the 
group consisting of: U62801, D49441; X51630, And AB018305; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 2 or as set forth in Table 1 and having an Accession Number selected from 
the group consisting of: U62801, D49441, X51630, And AB018305; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

14. A method of diagnosing a mucinous ovarian cancer in a human or animal subject 
being tested said method comprising contacting a biological sample from said subject 
being tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein an elevated level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a mucinous ovarian cancer, and wherein said nucleic acid probe comprises a 
sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of: NMJJ06149, AA315933, U47732, NMJJ05588, AW503395, 
NMJJ04Q63, AI073913, AI928445, NMJ)22454, W40460, AA1 32961 and 
AF111856; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
and having an Accession Number selected from the group consisting of: 
NMJJ06149, AA315933, U47732, NMJJ05588, AW503395, NMJ)04063, 
AI073913, AI928445, NMJ)22454, W40460, AA1 32961 and AF111856; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 and having an Accession Number selected from the group consisting of: 
NMJD06149, AA315933, U47732, NMJJ05588, AW503395, NMJJ04063, 
AI073913, AI928445, NMJJ22454, W40460, AA1 32961 and AF1 11856; and 
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(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

15. The method of claim 14 wherein the nucleic acid probe comprises a sequence 
5 selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from SEQ ID NO: 
57 or 59 or 61; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from SEQ ID NO: 57 or 59 or 61; 

10 (iii) a sequence that is at least about 80% identical to SEQ ID NO: 57 or 59 or 61 ; 

(iv) a sequence that encodes the amino acid sequence set forth in SEQ ID NO: 58 or 
60 or 62; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

15 

16. The method according to any one of claims 1 to 15 comprising performing a PCR 
reaction. 

17. The method according to any one of claims 1 to 16 comprising performing a 
20 nucleic acid hybridization. 

18. A method of detecting an ovarian cancer-associated polypeptide in a biological 
sample the method comprising contacting the biological sample with an antibody that 
binds specifically to an ovarian cancer-associated polypeptide in the biological sample, 

25 the polypeptide being encoded by a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-3. 

19. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising contacting a biological sample from said subject being 

30 tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein a modified level of the antigen- 
antibody complex for the subject being tested compared to the amount of the antigen- 
antibody complex formed for a control subject not having ovarian cancer indicates that 
the subject being tested has an ovarian cancer, and wherein said antibody binds to a 

35 polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
amino acid residues of a sequence having at least about 80% identity to a sequence 
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selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68,. 70, 
72, 74, 76, 78, 80, 82 and 84. 

5 20. The method of claim 19 wherein the antibody binds to a polypeptide comprising 
an amino acid sequence comprising at least about 10 contiguous amino acid residues of 
a sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 2, 6, 8, 10, 12, 14, 16, 18, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 47, 49, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

10 

21 . A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said, method comprising contacting a biological sample from said subject being 
tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein an enhanced level of the 

15 antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a. control subject not having ovarian cancer 
indicates that the subject being tested has an ovarian cancer, and wherein said antibody 
binds to a polypeptide comprising an amino acid sequence comprising at least about 10 
contiguous amino acid residues of a polypeptide encoded by a nucleic acid set forth in 

20 Table 1 or 2 other than a nucleic acid having an Accession Number selected from the 
group consisting of NMJ)221 17, NMJ)05460, NM_002387, AI745249 and AI694200. 

22. The. method of claim 21 wherein the antibody binds to a polypeptide comprising 
an amino acid sequence comprising at least about 10 contiguous amino acid residues of 

25 a sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44, 47, 49, 51, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

23. A method of diagnosing an ovarian cancer in a human or animal subject being 
30 tested said method comprising contacting a biological sample from said subject being 

tested with an antibody for a time and under conditions sufficient for an antigen-antibody 
complex to form and then detecting the complex wherein a reduced level of the antigen- 
antibody complex for the subject being tested compared to the amount of the antigen- 
antibody complex formed for a control subject not having ovarian cancer indicates that 
35 the subject being tested has an ovarian cancer, and wherein said antibody binds to a 
polypeptide comprising an amino acid sequence comprising at least about 10 contiguous 
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amino acid residues of a polypeptide encoded by a nucleic acid set forth in Table 1 and 
having an Accession Number selected from the group consisting of NM_022117, 
NM J)05460, NMJ)02387, AI745249 and AI694200. 

24. The method of claim 23 wherein the antibody binds to a polypeptide comprising 
an amino acid sequence comprising at least about 10 contiguous amino acid residues of 
a sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 2, 4, and 6. 

25. The method according to any one of claims 19 to 24 wherein the ovarian cancer 
that is diagnosed is an epithelial ovarian cancer. 

26. The method according to any one of claims 19 to 24 wherein the ovarian cancer 
that is diagnosed is selected from the group consisting of serous ovarian cancer, non- 
invasive ovarian cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, 
Brenner cell and undifferentiated adenocarcinoma. 

27. The method according to claim 26 wherein the ovarian cancer that is diagnosed is 
selected from the group consisting of serous ovarian cancer, mucinous ovarian cancer, 
endometrioid ovarian cancer and clear cell ovarian cancer. 

28. A method of diagnosing a serous ovarian cancer in a human or animal subject 
being tested said method comprising contacting a biological sample from said subject 
being tested with an antibody for a. time and under conditions sufficient for an antigen- 
antibody complex to form and then detecting the complex wherein a modified level of the. 
antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a control subject not having ovarian cancer 
indicates that the subject being tested has a serous ovarian cancer, and wherein said 
antibody binds to a polypeptide comprising an amino acid sequence comprising at least 
about 10 contiguous amino acid residues of a polypeptide encoded by a nucleic acid set 
forth in Table 2 or as set forth in Table 1 and having an Accession Number selected from 
the group consisting of: U62801, D49441, X51630, And AB018305. 

29. A method of diagnosing a mucinous ovarian cancer in a human or animal subject 
being tested said method comprising contacting a biological sample from said subject 
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being tested with an antibody for a time and under conditions sufficient for an antigen- 
antibody complex to form and then detecting the complex wherein a reduced level of the 
antigen-antibody complex for the subject being tested compared to the amount of the 
antigen-antibody complex formed for a control subject not having ovarian cancer 
5 indicates that the subject being tested has a mucinous ovarian cancer, and wherein said 
antibody binds to a polypeptide comprising an amino acid sequence comprising at least 
about 10 contiguous amino acid residues of a polypeptide encoded by a nucleic acid set 
forth in Table 1 and having an Accession Number selected from the group consisting of: 
NM_006149, AA315933, U47732, NM_005588, AW503395, NMJJ04063, AI073913, 
10 AI928445, NMJJ22454, W40460, AA132961 and AF1 1 1856. 

30. The method according to claim 29 wherein the antibody binds to a polypeptide 
comprising an amino acid sequence comprising at least about 10 contiguous amino acid 
residues of a sequence having at least about 80% identity to SEQ ID NO: 58 or 60 or 62. 

15 

31. A method of detecting an ovarian cancer-associated antibody in a biological 
sample the method comprising contacting the biological sample with a polypeptide 
encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% 
identical to a sequence as shown in Tables 1-3, wherein the polypeptide specifically 

20 binds to the ovarian cancer-associated antibody. 

32. The method according to any one of claims 1 to 31 wherein the biological sample 
is contacted with a plurality of nucleic acid probes and/or antibodies and/or polypeptides. 

25 33. The method according to any one of claims 1 to 32 wherein the subject being 
tested is a patient undergoing a therapeutic regimen to treat ovarian cancer. 

34. The method according to any one of claims 1 to 32 wherein the subject being 
tested is a subject suspected of having ovarian cancer. 

30 

35. A method of monitoring the efficacy of a therapeutic treatment of ovarian cancer, 
the method comprising: 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

35 (ii) determining the level of a ovarian cancer-associated transcript in the 

biological sample by contacting the biological sample with a polynucleotide 
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that selectively hybridizes to a sequence having at least about 80% identity 
to a sequence as shown, in any one of Tables 1-3, thereby monitoring the 
efficacy of the therapy. 

5 36. The method according to claim 35 further comprising comparing the level of the 
ovarian cancer-associated transcript to a level of the ovarian cancer-associated transcript 
in a biological sample from the patient prior to, or earlier in, the therapeutic treatment 

37. A method of monitoring the efficacy of a therapeutic treatment of ovarian cancer, 
10 the method comprising : 

(i) providing a biological sample from a patient undergoing the therapeutic 

treatment; and 

(ii) determining the level of a ovarian cancer-associated antibody in the 
biological sample by contacting the biological sample with a polypeptide 

15 encoded by a polynucleotide that selectively hybridizes to a sequence at 

least 80% identical to a sequence as shown in Tables 1-3, wherein the 
polypeptide specifically binds to the ovarian cancer-associated antibody, 
thereby monitoring the efficacy of the therapy. 

20 38. The method of claim 37 further comprising comparing the level of the ovarian 
cancer-associated antibody to a level of the ovarian cancer-associated antibody in a 
biological sample from the patient prior to, or earlier in, the therapeutic treatment. 

39. A method of monitoring the efficacy of a therapeutic treatment of ovarian cancer , 
25 the method comprising : 

(i) providing a biological sample from a patient undergoing the therapeutic 
treatment; and 

(ii) determining the level of a ovarian cancer-associated polypeptide in the 
biological sample by contacting the biological sample with an antibody, wherein 

30 the antibody specifically binds to a polypeptide encoded by a polynucleotide that 

selectively hybridizes to a sequence at least 80% identical to a sequence as 
shown in Tables 1-3, thereby monitoring the efficacy of the therapy. 
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40. The method of claim 39 further comprising comparing the level of the ovarian 
cancer-associated polypeptide to a level of the ovarian cancer-associated polypeptide in 
a biological sample from the patient prior to, or earlier in, the therapeutic treatment 

5 41 . The method according to any one of claims 35 to 40 wherein the ovarian cancer 
that is diagnosed is an epithelial ovarian cancer. 

42. The method according to any one of claims 35 to 41 wherein the ovarian cancer 
that is diagnosed is selected from the group consisting of serous ovarian cancer, non- 
10 invasive ovarian cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, 

endometrioid ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, 
Brenner cell and undifferentiated adenocarcinoma. 

43. The method according to claim 42 wherein the ovarian cancer that is diagnosed is 
15 selected from the group consisting of serous ovarian cancer, mucinous ovarian cancer, 

endometrioid ovarian cancer and clear cell ovarian cancer. 

44. A method of determining the likelihood of survival of a subject suffering from an 
ovarian cancer, said method comprising contacting a biological sample from said subject 

20 being tested with a nucleic acid probe for a time and under conditions sufficient for 
hybridization to occur and then detecting the hybridization wherein an elevated level of 
hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a poor probability of survival, and wherein said nucleic acid probe comprises a 
25 sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of: NMJ)03014, AA046217, NMJ)15902, T83882, AB040888, 
AA628980, AI623351, AW6T4420, AA243499, AF251237, AI970797, AF145713, 
30 X78565, T97307, BE243845, AW068302, AL1 33561, BE313555, X07820, 

AI973016, AF084545, U41518, Z11894, AW138190, BE086548, W47196, 
AI796870, X02761, AW968613, AW972565, AF045229, AW953853, U52426, 
F06700, AI798863, H52761, BE546947, AU076643, U20536, AA581602, 
AJ245210, X65965, AI806770, BE386490, AW581992, U77534, AL034417, 
35 L10343, AW518944, W28729, AI640160, U11862, AW295980, X59135, 

BE466173, AI354722, M90464, AA829286, AI333771, BE465867, NMJ)14992, 
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BE616902, AA430373, R27430, BE387335, AW264102, AW952323, AA088177, 
BE614567, AL079658, NM_002776, BE261944, NM_006379, AI002238, X81789, 
NM_002122, AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, 
N72403, BE003054, AL035588, AI080491, AW770994, H24177, AF1 46761, 
5 NM_001 955, AI680737, AI752666, AA505445, BE246649, and NM_003955; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
and having an Accession Number selected from the group consisting of: 
NM_003014, AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, 

10 AW614420, AA243499, AF251237, AI970797, AF145713, X78565, T97307. 

BE243845, AW068302, AL133561, BE313555, X07820, AI973016, AF084545, 
U41518, Z11894, AW138190, BE086548, W47196, AI796870, X02761, 
AW968613, AW972565, AF045229, AW953853, U52426, F06700, AI798863, 
H52761, BE546947, AU076643, U20536, AA581602, AJ245210, X65965, 

15 AI806770, BE386490, AW581992, U77534, AL034417. L10343, AW518944, 

W28729, AI640160, U11862, AW295980, X59135, BE466173, AI354722, 
M90464, AA829286, AI333771, BE465867, NM_014992, BE616902, AA430373, 
R27430, BE387335, AW264102, AW952323, AA088177, BE614567, AL079658, 
NM_002776, BE261944, NM_006379, AI002238, X81789, NM_002122, 

20 AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, N72403, 

BE003054, AL035588, AI080491, AW770994, H24177, AF146761, NM_001955, 
AI680737, AI752666, AA505445, BE246649, and NM_003955; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
25 Table 1 and having an Accession Number selected from the group consisting of: 

NM_003014, AA046217, NM_015902, T83882, AB040888, AA628980, AI623351, 
AW614420, AA243499, AF251237, AI970797, AF145713, X78565, T97307, 
BE243845, AW068302, AL133561, BE313555, X07820, AI973016, AF084545, 
U41518, Z11894, AW138190, BE086548, W47196, AI796870, X02761, 

30 AW968613, AW972565, AF045229, AW953853, U52426, F06700, AI798863, 

H52761, BE546947, AU076643, U20536, AA581602, AJ245210, X65965, 
AI806770, BE386490, AW581992, U77534, AL034417, L10343, AW518944, 
W28729, AI640160, U11862, AW295980, X59135, BE466173, AI354722, 
M90464, AA829286, AI333771, BE465867, NM_014992, BE616902, AA430373, 

35 R27430, BE387335, AW264102, AW952323, AA088177, BE614567, AL079658, 

NM_002776, BE261944, NM_006379, AI002238, X81789, NM_002122, 
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AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, N72403, 
BE003054, AL035588, AI080491, AW770994, H24177, AF146761, NMJKJ1955, 
AI680737, AI752666, AA505445, BE246649, and NM_003955; and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
5 (ii) or (iii) or (iv). 

45. The method of claim 44 wherein the nucleic acid probe comprises a sequence 
selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a sequence 
10 selected from the group consisting of SEQ ID NOs: 63, 65, 67, 69, 71 , and 73; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a sequence selected from the 
group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(iii) a sequence that is at least about 80% identical to a sequence selected from the 
15 group consisting of SEQ ID NOs: 63, 65, 67, 69, 71, 73, 75, 77, 79, 81 and 83; 

(iv) a sequence that encodes an amino acid sequence selected from the group 
consisting of SEQ ID NOs: 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84; and 

(v) a sequence that is complementary to (i) or (ii) or (iii) or (iv). 

20 46. A method of determining the likelihood of survival of a subject suffering from an 
ovarian cancer, said method comprising contacting a biological sample from said subject 
being tested with an antibody for a time and under conditions sufficient for an antigen- 
antibody complex to form and then detecting the complex wherein an enhanced level of 
the antigen-antibody complex for the subject being tested compared to the amount of the 

25 antigen-antibody complex formed for a control subject not having ovarian cancer 
indicates that the subject being tested has has a poor probability of survival, and wherein 
said antibody binds to a polypeptide comprising an amino acid sequence comprising at 
least about 10 contiguous amino acid residues of a sequence encoded by a nucleic acid 
set forth in Table 1 and having an Accession Number selected from the group consisting 

30 of: NMJ)03014 ( AA046217, NMJ315902, T83882, AB040888, AA628980, AI623351, 
AW614420, AA243499, AF251237, AI970797, AF145713, X78565, T97307, BE243845, 
AW068302, AL1 33561, BE313555, X0782Q, AI973016, AF084545, U41518, Z11894, 
AW138190, BE086548, W47196, AI796870, X02761, AW968613, AW972565, 
AF045229, AW953853, U52426, F06700, AI798863, H52761, BE546947, AU076643, 

35 U20536, AA581602, AJ245210, X65965, AI806770, BE386490, AW581992, U77534, 
AL034417, L10343, AW518944, W28729, AI640160, U11862, AW295980, X59135, 



WO 2004/022778 PCT/AU2003/001 166 

195 

BE466173, AI354722, M90464, AA829286, AI333771, BE465867, NM_014992, 
BE616902, AA430373, R27430, BE387335, AW264102, AW952323, AA088177, 
BE614567, AL079658, NMJ502776, BE261944, NMJ)06379, AI002238, X81789, 
NMJ)02122 t AB001914, AA311919, AI381750, AA292998, BE439580, AI677897, 
5 N72403, BE003054, AL035588, AI080491, AW770994, H24177, AF1 46761, 
NM_001955, AI680737, AI752666, AA505445, BE246649, and NM_003955. 

47. The method of claim 46 wherein the antibody binds to a polypeptide comprising 
an amino acid sequence comprising at least about 10 contiguous amino acid residues of 

10 a sequence having at least about 80% identity to a sequence selected from the group 
consisting of SEQ ID NOs: 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 and 84. 

48. A method of determining the likelihood of survival of a subject suffering from a 
serous ovarian cancer, said method comprising contacting a biological sample from said 

15 subject being tested with a nucleic acid probe for a time and under conditions sufficient 
for hybridization to occur and then detecting the hybridization wherein an elevated level 
of hybridization of the probe for the subject being tested compared to the hybridization 
obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a poor probability of survival, and wherein said nucleic acid probe comprises a 

20 sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid comprising the nucleotide sequence set forth in SEQ ID NO: 71 or 73; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid comprising the 

25 nucleotide sequence set forth in SEQ ID NO: 71 or 73; 

(iii) a sequence that is at least about 80% identical to (i) or (ii) and encoding an sFRP 
protein or a SOCS3 protein; 

(iv) a sequence that encodes a polypeptide comprising the amino acid sequence set 
forth in SEQ ID NO: 72 or 74; and 

30 (v) a sequence that is complementary, to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

49. A method of determining the likelihood of survival of a subject suffering from a 
serous ovarian cancer, said method comprising contacting a biological sample from said 

35 subject being tested with an antibody for a time and under conditions sufficient for an 
antigen-antibody complex to form and then detecting the complex wherein an enhanced 
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level of the antigen-antibody complex for the subject being tested compared to the 
amount of the antigen-antibody complex formed for a control subject not having ovarian 
cancer indicates that the subject being tested has a poor probability of survival, and 
wherein said antibody binds to an sFRP polypeptide comprising the amino acid 
5 sequence set forth in SEQ ID NO: 72 or a SOCS3 polypeptide comprising the amino acid 
sequence set forth in SEQ ID NO: 74. 

50. A method of determining the likelihood of survival of a subject suffering from a 
serous ovarian cancer, said method comprising contacting a biological sample from said 

10 subject being tested with at least two antibodies for a time and under conditions sufficient 
for antigen-antibody complexes to form and then detecting the complexes wherein an 
enhanced level of the antigen-antibody complexes for the subject being tested compared 
to the amount of the antigen-antibody complexes formed for a control subject not having 
ovarian cancer indicates that the subject being tested has a poor probability of survival, 

15 and wherein one antibody binds to an sFRP polypeptide comprising the amino acid 
sequence set forth in SEQ ID NO: 72 and wherein one antibody binds to a SOCS3 
polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 74. 

51. The method according to any one of claims 44 to 47 wherein the ovarian cancer 
20 is an epithelial ovarian cancer. 

52. The method according to any one of claims 44 to 47 wherein the ovarian cancer is 
selected from the group consisting of serous ovarian cancer, non-invasive ovarian 
cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, endometrioid 

25 ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, Brenner cell 
and undifferentiated adenocarcinoma. 

53. The method according to claim 52 wherein the ovarian cancer is selected from the 
group consisting of serous ovarian cancer, mucinous ovarian cancer, endometrioid 

30 ovarian cancer and clear cell ovarian cancer. 

54. A method of determining the likelihood that a subject will suffer from a recurrence 
of an ovarian cancer, said method comprising contacting a biological sample from said 
subject being tested with a nucleic acid probe for a time and under conditions sufficient 

35 for hybridization to occur and then detecting the hybridization wherein an elevated level 
of hybridization of the probe for the subject being tested compared to the hybridization 
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obtained for a control subject not having ovarian cancer indicates that the subject being 
tested has a high probability of recurrence, and wherein said nucleic acid probe 
comprises a sequence selected from the group consisting of: 

(i) a sequence comprising at least about 20 contiguous nucleotides from a nucleic 
acid set forth in Table 1 and having an Accession Number selected from the group 
consisting of: M86849, AW963419, BE298665, AK000637, BE077546, T97307, 
R24601, BE090176, AA393907, W28729, BE313754, AW673081, AA356694, 
L08239, BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573, 
BE564162, BE439580, AW067800, AA569756, AW138190, AF1 26245, L10343, 
NM_002514, AI863735, NM_005397, W26391, H15474, U51166, AA243499, 
AW408807, AI738719, AB040888, BE313077, AI677897, C14898, AI821730, 
AF007393, H65423. N46243, AA095971, U20350, NM_005756, D19589, 
AW957446, AW294647, BE159718, AI888490, AA022569, BE1 47740, AI798863, 
BE464341, AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, 
AW953853, AU076611, AW968613, AL353944, BE614149, AA292998, H12912, 
AA1 88763, AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, 
AK001564, AW959861, BE313555, W25005, AI193356, AF111106, AI130740, 
AA985190, BE221880, AF084545, R26584, AW247380, AA364261, U25849, 
AF262992, AW342140, AL1 33572, AI497778, AI745379, U51712, AW375974, 
AF251237, NM_000636, AA130986, AA216363, AA628980, AA811657, ' 
AA897108, AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, 
AI827248, AK002039, AL1 09791, AW090198, AW296454, AW445034, 
AW452948, AW470411, AW885727, AW970859, AW979189, BE1 65866, 
BE1 75582, BE242587, BE271927. BE439580, BE464016, D63216, F34856, 
M83822, N33937, N49068, N51357, N80486, NM_000954, NM_005756, 
NM_016652, R26584, R31178, W05391, W25005, W45393, W68815, X65965, 
X76732 andZ45051, 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to at least about 20 contiguous nucleotides from a nucleic acid set forth in Table 1 
and having an Accession Number selected from the group consisting of: M86849, 
AW963419, BE298665, AK000637, BE077546, T97307, R24601, BE090176, 
AA393907, W28729, BE313754, AW673081, AA356694, L08239, BE397649, 
NM_012317, NM_000947, AJ250562, AL040183, BE207573, BE564162, 
BE439580, AW067800, AA569756, AW138190, AF1 26245, L10343, NM_002514, 
AI863735, NM_005397, W26391, H 15474, U51166, AA243499, AW408807, 
AI738719, AB040888, BE313077, AI677897, C14898, AI821730, AF007393, 
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H65423, N46243, AA095971, U20350, NM_005756, D19589, AW957446, 
AW294647, BE159718, AI888490, AA022569, BE147740, AI798863, BE464341, 
AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, AW953853, 
AU076611, AW968613, AL353944, BE614149, AA292998, H12912, AA188763, 
5 AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, AK001564, 

AW959861, BE313555, W25005, AI193356, AF111106, AI130740. AA985190, 
BE221880, AF084545. R26584, AW247380, AA364261, U25849, AF262992, 
AW342140, AL133572, AI497778, AI745379, U51712, AW375974, AF251237, 
NM_000636, AA130986, AA216363, AA628980, AA811657, AA897108, 

10 AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, AI827248, 

AK002039, AL1 09791, AW090198, AW296454, AW445034, AW452948, 
AW470411, AW885727, AW970859, AW979189, BE165866, BE175582, 
BE242587, BE271927, BE439580, BE464016, D63216, F34856, M83822, 
N33937, N49068, N51357, N80486, NM_000954, NM_005756, NM_016652, 

15 R26584, R31178, W05391, W25005^ W45393, W68815, X65965, X76732 and 

Z45051; 

(iii) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleic acid set forth in 
Table 1 and having an Accession Number selected from the group consisting 

20 of:M86849, AW963419, BE298665, AK000637, BE077546, T97307, R24601, 

BE090176, AA393907, W28729, BE313754, AW673081, AA356694, L08239, 
BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573, 
BE564162, BE439580, AW067800, AA569756, AW138190, AF1 26245, L10343, 
NM_002514, AI863735, NM_005397, W26391, H15474, U51166, AA243499, 

25 AW408807. AI738719. AB040888, BE313077, AI677897, C14898, AI821730, 

AF007393, H65423, N46243, AA095971, U20350, NM_005756, D19589, 
AW957446, AW294647, BE159718, AI888490, AA022569, BE1 47740, AI798863, 
BE464341, AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, 
AW953853, AU076611, AW968613, AL353944, BE614149, AA292998, H12912, 

30 AA1 88763, AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, 

AK001564, AW959861, BE313555, W25005, AI193356, AF111106, AI130740. 
AA985190, BE221880, AF084545, R26584, AW247380, AA364261, U25849, 
AF262992, AW342140, AL1 33572, AI497778, AI745379, U51712, AW375974, 
AF251237, NM_000636, AA1 30986, AA216363, AA628980, AA811657, 

35 AA897108, AB040888, AF212225, AI089575, AI282028, AI368826, AI718702, 

AI827248, AK002039, AL1 09791, AW090198, AW296454, AW445034, 
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AW452948, AW470411, AW885727, AW970859, AW979189, BE1 65866, 
BE175582, BE242587, BE271927, BE439580, BE464016, D63216, F34856, 
M83822, N33937, N49068, N51357, N80486, NM_000954, NM_005756, 
NM_016652, R26584, R31178, W05391, W25005, W45393, W68815, X65965, 
X76732andZ45051;and 
(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii)or(iii)or(iv). 

55. The method of claim 54 determining the likelihood that a subject will suffer from a 
recurrence of an ovarian cancer, said method comprising contacting a biological sample 
from said subject being tested with an antibody for a time and under conditions sufficient 
for an antigen-antibody complex to form and then detecting the complex wherein an 
enhanced level of the antigen-antibody complex for the subject being tested compared to 
the amount of the antigen-antibody complex formed for a control subject not having 
ovarian cancer indicates that the subject being tested has a high probability of 
recurrence, and wherein said antibody binds to a polypeptide comprising an amino acid 
sequence comprising at least about 10 contiguous amino acid residues of a sequence 
encoded by a nucleic acid set forth in Table 1 and having an Accession Number selected 
from the group consisting of: M86849, AW963419, BE298665, AK000637, BE077546, 
T97307, R24601, BE090176, AA393907, W28729, BE313754, AW673081, AA356694, 
L08239, BE397649, NM_012317, NM_000947, AJ250562, AL040183, BE207573, 
BE564162, BE439580, AW067800, AA569756, AW138190, AF1 26245, L10343, 
NM_002514, AI863735, NM_005397. W26391, H15474, U51166, AA243499, 
AW408807, AI738719, AB040888, BE313077, AI677897, C14898, AI821730, 
AF007393. H65423, N46243, AA095971, U20350, NM_005756, D19589, AW957446, 
AW294647, BE159718, AI888490, AA022569, BE1 47740, AI798863, BE464341, 
AL080235, AI557212, X75208, AA628980, BE242587, NM_005512, AW953853, 
AU076611, AW968613, AL353944, . BE614149, AA292998, H12912, AA188763, 
AK000596, AI970797, AW519204, Z42387, AF145713, AA972412, AK001564, 
AW959861, BE313555, W25005, AI193356, AF111106, AI130740, AA985190, 
BE221880, AF084545, R26584, AW247380, AA364261, U25849, AF262992, 
AW342140, AL1 33572, AI497778, AI745379, U51712, AW375974, AF251237, 
NM_000636, AA130986, AA216363, AA628980, AA811657, AA897108, AB040888, 
AF212225, AI089575, AI282028, AI368826, AI718702, AI827248, AK002039, 
AL1 09791, AW090198, AW296454, AW445034, AW452948, AW470411, AW885727, 
AW970859, AW979189, BE165866, BE1 75582, BE242587, BE271927, BE439580, 
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BE464016, D63216, F34856, M83822, N33937, N49068, N51357, N80486, NMJ)00954, 
NMJ)05756, NM_016652, R26584, R31178, W05391, W25005, W45393, W68815, 
X65965,X76732andZ45051. . 

5 56. The method according to claim 54 or 55 wherein the ovarian cancer is an epithelial 
ovarian cancer. 

57. The method according to any one of claims 54 to 56 wherein the ovarian cancer is 
selected from the group consisting of serous ovarian cancer, non-invasive ovarian 

10 cancer, mixed phenotype ovarian cancer, mucinous ovarian cancer, endometrioid 
ovarian cancer, clear cell ovarian cancer, papillary serous ovarian cancer, Brenner cell 
and undifferentiated adenocarcinoma. 

58. The method according to claim 57 wherein the ovarian cancer is selected from the 
15 group consisting of serous ovarian cancer, mucinous ovarian cancer, endometrioid 

ovarian cancer and clear cell ovarian cancer. 

59. The method according to any one of claims 35 to 58 wherein the biological sample 
is contacted with a plurality of nucleic acid probes and/or antibodies and/or polypeptides. 

20 

60. A method for identifying a compound that modulates an ovarian cancer- 
associated polypeptide, the method comprising : 

(i) contacting the compound with a ovarian cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence 

25 at least 80% identical to a sequence as shown in Tables 1-3; and 

(ii) determining the functional effect of the compound upon the polypeptide. 

61. A method for determining a candidate compound for the treatment of ovarian 
cancer comprising : 

30 (i) administering a test compound to a mammal having ovarian cancer or a cell 

isolated therefrom; 

(ii) comparing the level of gene expression of a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 
1-3 in a treated cell or mammal with the level of gene expression of the 
35 polynucleotide in a control cell or mammal, wherein a test compound that 
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modulates the level of expression of the polynucleotide is a candidate for the 
treatment of ovarian cancer. 



62. An assay device for use in the diagnosis or prognosis of ovarian cancer, said 
5 device comprising a plurality of polynucleotides immobilized to a solid phase, wherein 

each of said polnucleotides consists of a gene as listed in any one of Tables 1-3. 

63. The device of claim 62 consisting of a substantially planar chip. 

10 64. An assay device for use in the diagnosis or prognosis of ovarian cancer, said 
device comprising a plurality of different antibodies immobilized to a solid phase, wherein 
each of said antibodies binds to a polypeptide listed in Tables 1-3. 

65. The device of claim 64 consisting of a substantially planar chip. 

15 

66. Use of a polynucleotide as set forth in any one of Tables 1-3 in the diagnosis or 
prognosis of ovarian cancer or for the preparation of a medicament for the treatment of 
ovarian cancer. 

20 67. Use of a vector comprising a polynucleotide as set forth in any one of Tables 1-3 
in the diagnosis or prognosis of ovarian cancer or for the preparation of a medicament for 
the treatment of ovarian cancer. 

68. Use of an isolated polypeptide as set forth in any one of Tables 1-3 in the 
25 diagnosis or prognosis of ovarian cancer or for the preparation of a medicament for the 

treatment of ovarian cancer. 

69. Use of an antibody that binds to an isolated polypeptide as set forth in any one of 
Tables 1-3 in the diagnosis or prognosis of ovarian cancer or for the preparation of a 

30 medicament for the treatment of ovarian cancer. 

70. A method of diagnosing an ovarian cancer in a human or animal subject being 
tested said method comprising determining aberrant methylation in a promoter sequence 
that regulates expression of a tumor suppressor gene in a biological sample from said 

35 subject compared to the methylation of the promoter in nucleic acid obtained for a control 
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subject not having ovarian cancer wherein said aberrant methylation indicates that the 
subject being tested has an ovarian ovarian cancer. 

71. The method of claim 70 wherein hypermethylation of the promoter sequence is 
determined. 

72. The method of claim 70 or 71 wherein the methylation is determined in the 
promoter region that regulates expression of an MCC gene comprising a sequence 
selected from the group consisting of: 

(i) the nucleotide sequence set forth as SEQ ID NO: 3; 

(ii) a sequence that hybridizes under at least low stringency hybridization conditions 
to the nucleotide sequence set forth as SEQ ID NO: 3; 

(Hi) a sequence that is at least about 80% identical to (i) or (ii); 

(iv) a sequence that encodes a polypeptide encoded by a nucleotide sequence set 
forth as SEQ ID NO: 3; and 

(v) a sequence that is complementary to any one of the sequences set forth in (i) or 
(ii) or (iii) or (iv). 

73. The method according to any one of claims 70 to 72 wherein the ovarian cancer 
that is diagnosed is an epithelial ovarian cancer. 
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SEQUENCE LISTING 

<110> Garvan Institute of Medical Research 

<120> Methods for the diagnosis and prognosis of ovarian cancer 

<130> 501731/MRO 

<150> AU2002951346 

<151> 2002-09-05 

<160> 84 

<170> Patentln version 3.1 

<210> 1 ■ 

<211> 3432 

<212> DNA 

<213> NMJJ05460 SNCAIP, synphilin 

<220> 

<221> CDS 

<222> (94) 
<223> 



(2850) 



<400> 1 

gtcagtccct tcgcgctcct gagccgccgg cgcgccgggc gccccgggaa tttataagta 60 

tttgaccgta ctcaaaatgt gcaaggaaga ata atg gaa gcc cct gaa tac ctt 114 

• Met Glu Ala Pro Glu Tyr Leu 
1 5 

gat ttg gat gaa att gac ttt agt gat gac ata tct tat tea gtc aca 162 
Asp Leu Asp Glu He Asp Phe Ser Asp Asp He Ser Tyr Ser Val Thr 
10 15 20 

tea etc aag acg ate cca gaa ctg tgc cga aga tgt gat acg caa aac 210 
Ser Leu Lys Thr He Pro Glu Leu Cys Arg Arg Cys Asp Thr Gin Asn 
25 30 35 

gaa gac aga tea get tct age tct age tgg aat tgt ggc ate tea act 258 
Glu Asp Arg Ser Ala Ser Ser Ser Ser Trp Asn Cys Gly He Ser Thr 
40 45 50 55 

ctt att aca aac acg caa aag ccc aca gga ate get gat gtg tac agt 306 
Leu He Thr Asn Thr Gin Lys Pro Thr Gly He Ala Asp Val Tyr Ser 
60 65 70 

aag ttc cgc cca gtg aag egg gtt teg cca ctg aaa cat cag cca gag 354 
Lys Phe Arg Pro Val Lys Arg Val Ser Pro Leu Lys His Gin Pro Glu 
75 80 85 

act ctg gag aac aat gaa agt gat gac caa aag aac cag aaa gtg gtt 402 
Thr Leu Glu Asn Asn Glu Ser Asp Asp Gin Lys Asn Gin Lys Val Val 
90 95 100 

gag tac cag aaa ggg ggt gag tct gac ctg ggc ccc cag cct cag gag 450 
Glu Tyr Gin Lys Gly Gly Glu Ser Asp Leu Gly Pro Gin Pro Gin Glu 
105 110 115 

ctt ggc cct gga gat gga gtg ggc ggc cca cca ggt aag age tct gag 498 
Leu Gly Pro Gly Asp Gly Val Gly Gly Pro Pro Gly Lys Ser Ser Glu 
120 125 130 135 

ccc age aca teg ctg ggt gaa ctg gag cac tac gac etc gac atg gat 546 
Pro Ser Thr Ser Leu Gly Glu Leu Glu His Tyr A3p Leu Asp Met Asp 
140 145 150 
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gag att ctg gat gtg cct tat att aaa tec agt cag cag ctt gec tct 594 
Glu He Leu Asp Val Pro Tyr He Lys Ser Ser Gin Gin Leu Ala Ser 
155 160 . 165 

ttt acc aag gtg act tea gaa aaa aga att ttg ggc tta tgc aca acc 642 
Phe Thr Lys Val Thr Ser Glu Lys Arg He Leu Gly Leu Cys Thr Thr 
170 175 180 

ate aat ggc ctt tct ggc aaa gee tgc tct aca gga agt tct gag age 690 
He Asn Gly Leu Ser Gly Lys Ala Cys Ser Thr Gly Ser Ser Glu Ser 
185 190 195' 

tea tea tec aac atg gca cca ttt tgt gtt ctt tct ccc gtg aaa age 738 
Ser Ser Ser Asn Met Ala Pro Phe Cys Val Leu Ser Pro Val Lys Ser 
200 205 210 215 

cct cac ttg aga aaa gca tea get gtc ate cac gac cag cac aag ctg 786 
Pro His Leu Arg Lys Ala Ser Ala Val He His Asp Gin His Lys Leu 
220 225 230 

tec act gaa gaa acc gag ate tea cct cct ctg gtt aaa tgt ggc tct 834 
Ser Thr Glu Glu Thr Glu He Ser Pro Pro Leu Val Lys Cys Gly Ser 
235 240 245 

gca tat gag cct gaa aac cag agt aaa gac ttc eta aac aag aca ttt 882 
Ala Tyr Glu Pro Glu Asn Gin Ser Lys Asp Phe Leu Asn Lys Thr Phe 
250 255 260 

agt gat cct cat ggt cga aaa gtt gag aag aca aca cca gac tgc cag 930 
Ser Asp Pro His Gly Arg Lys Val Glu Lys Thr Thr Pro Asp Cys Gin 
265 270 275 

etc agg gee ttc cac eta caa tec tea gca gca gaa tec aaa cca gaa 978 
Leu Arg Ala Phe His Leu Gin Ser Ser Ala Ala Glu Ser Lys Pro Glu 
280 285 290 " 295 

gag cag gtc agt ggc eta aac egg acc age tec caa ggc cca gaa gaa 1026 
Glu Gin Val Ser Gly Leu Asn Arg Thr Ser Ser Gin Gly Pro Glu Glu 
300 305 310 

agg agt gag tat ctg aaa aaa gtg aaa age ate ttg aac att gtt aaa 1074 
Arg Ser Glu Tyr Leu Lys Lys Val Lys Ser He Leu Asn He Val Lys 
315 320 325 

gaa gga cag ate tct etc ctg cca cac eta get gca gac aat eta gac 1122 
Glu Gly Gin He Ser Leu Leu Pro His Leu Ala Ala Asp Asn Leu Asp 
330 335 340 

aaa att cac gac gaa aat gga aac aat eta tta cat att gcg gcg tea 1170 
Lys He His Asp Glu Asn Gly Asn Asn Leu Leu His He Ala Ala Ser 
345 350 355 

cag gga cac gca gag tgt eta cag cac etc act tct ttg atg gga gaa 1218 
Gin Gly His Ala Glu Cys Leu Gin His Leu Thr Ser Leu Met Gly Glu 
360 365 370 375 

gac tgc etc aat gag cgc aac act gag aag ttg act cca gca ggc ctg 12 66 
Asp Cys Leu Asn Glu Arg Asn Thr Glu Lys Leu Thr Pro Ala Gly Leu 
380 385 390 

gee att aag aat ggt cag ttg gag tgc gta cgc tgg atg gtg age gaa 1314 
Ala He Lys Asn Gly Gin Leu Glu Cys Val Arg Trp Met Val Ser Glu 
395 400 " " 405 

aca gaa gee att gca gaa ctg agt tgt tct aag gat ttt cca age ctt 1362 
Thr Glu Ala He Ala Glu Leu Ser Cys Ser Lys Asp Phe Pro Ser Leu 
410 415 " 420 
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att cat tac gca ggt tgc tat ggc cag gaa aag att ctt ctg tgg ctt 1410 
lie His Tyr Ala Gly Cys Tyr Gly Gin Glu Lys lie Leu Leu Trp Leu 
425 430 435 

ctt cag ttt atg caa gaa cag ggc ate .teg ttg gat gaa gta gac cag 1458 
Leu Gin Phe Met Gin Glu Gin Gly lie Ser Leu Asp Glu Val Asp Gin 
440 445 450 455 

gat ggc aac agt gec gtt cac gta gec tea cag cat ggc tac ctt gga 1506 
Asp Gly Asn Ser Ala Val His Val Ala Ser Gin His Gly Tyr Leu Gly 
460 465 470 

tgc ata cag acc ttg gtt gaa tat gga gca aat gtc acc atg cag aac 1554 
Cys lie Gin Thr Leu Val Glu Tyr Gly Ala Asn Val Thr Met Gin Asn 
475 480 485 

cac get ggg gaa aag ccc tec cag age gec gag egg cag ggg cac acc 1602 
His Ala Gly Glu Lys Pro Ser Gin Ser Ala Glu Arg Gin Gly His Thr 
490 495 500 

ctg tgc tec agg tac ctg gtg gtg gtg gag acc tgc atg teg ctg gee 1650 
Leu Cys Ser Arg Tyr Leu Val Val Val Glu Thr Cys Met Ser Leu Ala 
505 510 515 

tct caa gtg gtg. aag tta acc aag cag eta aag gaa caa aca gta gaa 1698 
Ser Gin Val Val Lys Leu Thr Lys Gin Leu Lys Glu Gin Thr Val Glu 
520 525 530 535 

cgt gtc acg ctg cag aac caa etc caa caa ttt eta gaa gee cag aaa 1746 
Arg Val Thr Leu Gin Asn Gin Leu Gin Gin Phe Leu Glu Ala Gin Lys 
540 545 550 

tea gag ggc aag tea etc cct tct tea ccc agt tea cca tec tea cct 1794 
Ser Glu Gly Lys Ser Leu Pro Ser Ser Pro Ser Ser Pro Ser Ser Pro . 
555 560 565 

gec tec aga aag tec cag tgg aaa tct cca gat gca gat gat gat tct 1842 
Ala Ser Arg Lys Ser Gin Trp Lys Ser Pro Asp Ala Asp Asp Asp Ser 
570 575 580 

gta gee aaa age aag cca gga gtc caa gag ggg att .cag gtt ctt gga 1890 
Val Ala Lys Ser Lys Pro Gly Val Gin Glu Gly He Gin Val Leu Gly 
585 590 595 

age ctg tea gec tec age egg get aga ccc aaa gca aaa gat gaa gat 1938 
Ser Leu Ser Ala Ser Ser Arg Ala Arg Pro Lys Ala Lys Asp Glu Asp 
600 605 610 * 615 

tct gat aaa ate tta cgc cag tta ttg gga aag gaa ate tea gaa aat 1986 
Ser Asp Lys He Leu Arg Gin Leu Leu Gly Lys Glu He Ser Glu Asn 
620 625 " 630 

gtc tgc acc cag gaa aaa ctg tec ttg gaa ttc cag gat get cag get 2034 
Val Cys Thr Gin Glu Lys Leu Ser Leu Glu Phe Gin Asp Ala Gin Ala 
635 640 ~ 645 

tec tct aga aat tct aaa aag ate cca ctg gag aag agg gaa ctg aag 2082 
Ser Ser Arg Asn Ser Lys Lys He Pro Leu Glu Lys Arg Glu Leu Lys 
650 655 660 

tta gee agg ctg aga cag ctg atg cag agg tea ctg agt gag tct gac 2130 
Leu Ala Arg Leu Arg Gin Leu Met Gin Arg Ser Leu Ser Glu Ser Asp 
665 670 675 

aca gac tec aac aac tct gag gac ccc aag act acc cca gtg agg aag 2178 
Thr Asp Ser Asn Asn Ser Glu Asp Pro Lys Thr Thr Pro Val Arg Lys 
680 685 690 695 
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get gac cga cca agg ccg cag ccc att gta gaa age gta gag agt atg 2226 
Ala Asp Arg Pro Arg Pro Gin Pro He Val Glu Ser Val Glu Ser Met 
700 705 710 

gac age gca gaa age ctg cac ctg atg att aag aaa cac ace ttg gca 2274 
Asp Ser Ala Glu Ser Leu His Leu Met He Lys Lys His Thr Leu Ala 
715 720 725 

tea ggg gga cgc agg ttt cct ttc age ate aag gee tec aaa tec ctg 2322 
Ser Gly Gly Arg Arg Phe Pro Phe Ser He Lys Ala Ser Lys Ser Leu 
730 735 740 

gat ggc cac age cca tct ccc ace tea gag age age gaa cca gac tta 2370 
Asp Gly His Ser Pro Ser Pro Thr Ser Glu Ser Ser Glu Pro Asp Leu 
745 750 755 

gaa tec cag tat cca ggc tea ggg agt att cct cca aac cag ccc tct 2418 
Glu Ser Gin Tyr Pro Gly Ser Gly Ser He Pro Pro Asn Gin Pro Ser 
760 765 770 77.5 

ggt gac cct cag cag ccc age cct gac agt act get gee cag aaa gtt 2466 
Gly Asp Pro Gin Gin Pro Ser Pro Asp Ser Thr Ala Ala Gin Lys Val 
780 785 790 

gee aca agt ccc aag agt gee etc aag tct cca tct tec aag cgt agg 2514 
Ala Thr Ser Pro Lys Ser Ala Leu Lys Ser Pro Ser Ser Lys Arg Arg 
795 800 805 

aca tct cag aac tta aaa ctg aga gtt acc ttt gag gag cct gtg gtg 2562 
Thr Ser Gin Asn Leu Lys Leu Arg Val Thr Phe Glu Glu Pro Val Val 
810 815 820 

cag atg gag cag cct age ctt gaa ctg aat gga gaa aaa gac aaa gat 2610 
Gin Met Glu Gin Pro Ser Leu Glu Leu Asn Gly Glu Lys Asp Lys Asp 
825 830 ' 835 

aag ggc agg act etc cag egg acc tec aca agt aac gaa teg ggg gat 2658 
Lys Gly Arg Thr Leu Gin Arg Thr Ser Thr Ser Asn Glu Ser Gly Asp 
840 845 850 855 

caa ctg aaa agg cct ttt gga gee ttt cga tct ate atg gag aca eta 2706 
Gin Leu Lys Arg Pro Phe Gly Ala Phe Arg Ser He Met Glu Thr Leu 
860 . 865 870 

agt ggc aac caa aac aat aat aat aac tac cag gca gee aac cag ctg 2754 
Ser Gly Asn Gin Asn Asn Asn Asn Asn Tyr Gin Ala Ala Asn Gin Leu 
875 880 885 

aaa acc tct aca ttg ccc ttg acc tea ctt ggg agg aag aca gat gee 2802 
Lys Thr Ser Thr Leu Pro Leu Thr Ser Leu Gly Arg Lys Thr Asp Ala 
890 895 900 

aag gga aac cct gee age tec get age aaa gga aag aat aag gca gca 2850 
Lys Gly Asn Pro Ala Ser Ser Ala Ser Lys Gly Lys Asn Lys Ala Ala 



905 




910 915 




taatgacatc 


aatagaaaaa 


tgaagaaatc ctacagcata aagcacattg ctgagecaga 


2910 


gtcaaaagaa 


ctcttcttgt 


aaatcacttt ttaaattttc tctcactgat gccctttgga 


2970 


aattattgga 


aatttctgga 


ctatcctctt tggaaagaga accatgaaaa caatgcctca 


3030 


ccagcagaag 


aacagaatat 


caggatgect taaatttata gtagtagact gtaaaagatt 


3090 


cattttgggg 


tgatatctgt 


atatataact tgttttttta aaagatgccg tttaaaagca 


3150 


tgattgggaa 


aatgtacgtt 


ttttaagagt agattgattc accctaccca caggacattc 


3210 
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accaagccac tgataccatt ttatatttca tcaattgcat gagtatttgc taatgttgat 3270 

tgaacctccc tttccccata atgtgggcag atttggctca gctccttcat gagatcaggt 3330 

cagtggtatt gtttctgtca agagtgtttt ttctgtcatt tctacttttt gtataaagga 3390 

aataaaacaa tgttaacagc caaaaaaaaa aaaaaaaaaa aa 3432 

<210> 2 

<211> 919 

<212> PRT 

<213> NMJJ05460 SNCAIP, synphilin 



<400> 2 

Met Glu Ala Pro Glu Tyr Leu Asp Leu Asp Glu lie Asp Phe Ser Asp 
1'5 10 15 



Asp lie Ser Tyr Ser Val Thr Ser Leu Lys Thr lie Pro Glu Leu Cys 
20 25 30 



Arg Arg Cys Asp Thr Gin Asn Glu Asp Arg Ser Ala Ser Ser Ser Ser 
35 40 45 



Trp Asn Cys Gly lie Ser Thr Leu lie Thr Asn Thr Gin Lys Pro Thr 
50 55 60 



Gly lie Ala Asp Val Tyr Ser Lys Phe Arg Pro Val Lys Arg Val Ser 
65 70 75 80 



Pro Leu Lys His Gin Pro Glu Thr Leu Glu Asn Asn Glu Ser Asp Asp 
85 90 95 



Gin Lys Asn Gin Lys Val Val Glu Tyr Gin Lys Gly Gly Glu Ser Asp 
100 105 . 110 



Leu Gly Pro Gin Pro Gin Glu Leu Gly Pro Gly Asp Gly Val Gly Gly 
115 120 125 



Pro Pro Gly Lys Ser Ser Glu Pro Ser Thr Ser Leu Gly Glu Leu Glu 
130 135 .140 



His Tyr Asp Leu Asp Met Asp Glu lie Leu Asp Val Pro Tyr lie Lys 
145 150 155 160 



Ser Ser Gin Gin Leu Ala Ser Phe Thr Lys Val Thr Ser Glu Lys Arg 
165 170 175 



lie Leu Gly Leu Cys Thr Thr lie Asn Gly Leu Ser Gly Lys Ala Cys 
180 185 190 
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Ser Thr Gly Ser Ser Glu Ser Ser Ser Ser Asn Met Ala Pro Phe Cys 
195 200 205 



Val Leu Ser Pro Val Lys Ser Pro His Leu Arg Lys Ala Ser Ala Val 
210 215 220 



lie His Asp Gin His Lys Leu Ser Thr Glu Glu Thr Glu lie Ser Pro 
225 230 235 240 



Pro Leu Val Lys Cys Gly Ser Ala Tyr Glu Pro Glu Asn Gin Ser Lys 
245 250 255 



Asp Phe Leu Asn Lys Thr Phe Ser Asp Pro His Gly Arg Lys Val Glu 
260 265 270 



Lys Thr Thr Pro Asp Cys Gin Leu Arg Ala Phe His Leu Gin Ser Ser 
275 280 285 



Ala Ala Glu Ser Lys Pro Glu Glu Gin Val Ser Gly Leu Asn Arg Thr 
290 295 300 ■ 



Ser Ser Gin Gly Pro Glu Glu Arg Ser Glu Tyr Leu Lys Lys Val Lys 
305 310 315 ' 320 



Ser He Leu Asn He Val Lys Glu Gly Gin He Ser Leu Leu Pro His 
325 330 335 



Leu Ala Ala Asp Asn Leu Asp Lys lie His Asp Glu Asn Gly Asn Asn 
340 345 350 



Leu Leu His He Ala Ala Ser Gin Gly His Ala Glu Cys Leu Gin His 
355 360 365 



Leu Thr Ser Leu Met Gly Glu Asp Cys Leu Asn Glu Arg Asn Thr Glu 
370 375 380 



Lys Leu Thr Pro Ala Gly Leu Ala He Lys Asn Gly Gin Leu Glu Cys 
385 390 395 400 



Val Arg Trp Met Val Ser Glu Thr Glu Ala He Ala Glu Leu Ser Cys 
405 410 415 



Ser Lys Asp Phe Pro Ser Leu He His Tyr Ala Gly Cys Tyr Gly Gin 
420 425 430 



Glu Lys He Leu Leu Trp Leu Leu Gin Phe Met Gin Glu Gin Gly He 
435 440 445 



Ser Leu Asp Glu Val Asp Gin Asp Gly Asn Ser Ala Val His Val Ala 
450 455 460 
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Ser Gin His Gly Tyr Leu Gly Cys lie Gin Thr Leu Val Glu Tyr Gly 
465 470 ' 475 480 



Ala Asn Val Thr Met Gin Asn His Ala Gly Glu Lys Pro Ser Gin Ser 
485 490 495 



Ala Glu Arg Gin Gly His Thr Leu Cys Ser Arg Tyr Leu Val Val Val 
500 505 510 



Glu Thr Cys Met Ser Leu Ala Ser Gin Val Val Lys Leu Thr Lys Gin 
515 520 525 



Leu Lys Glu Gin Thr Val Glu Arg Val Thr Leu Gin Asn Gin Leu Gin 
530 535 540 



Gin Phe Leu Glu Ala Gin Lys Ser Glu Gly Lys Ser Leu Pro Ser Ser 
545 550 555 560 



Pro Ser Ser Pro Ser Ser Pro Ala Ser Arg Lys Ser Gin Trp Lys Ser 
565 570 575 

Pro Asp Ala Asp Asp Asp Ser Val Ala Lys Ser Lys Pro Gly Val Gin 
580 585 590 

Glu Gly He Gin Val Leu Gly Ser Leu Ser Ala Ser Ser Arg Ala Arg 
595 600 605 

Pro Lys Ala Lys Asp Glu Asp Ser Asp Lys He Leu Arg Gin Leu Leu 
610 615 620 



Gly Lys Glu He Ser Glu Asn Val Cys Thr Gin Glu Lys Leu Ser Leu 
625 630 635 640 



Glu Phe Gin Asp Ala Gin Ala Ser Ser Arg Asn Ser Lys Lys He Pro 
645 650 " 655 



Leu Glu Lys Arg Glu Leu Lys Leu Ala Arg Leu Arg Gin Leu Met Gin 
660 665 670 



Arg Ser. Leu Ser Glu Ser Asp Thr Asp Ser Asn Asn Ser Glu Asp Pro 
675 680 685 



Lys Thr Thr- Pro Val Arg Lys Ala Asp Arg Pro Arg Pro Gin Pro He 
690 695 700 



Val Glu Ser Val Glu Ser Met Asp Ser Ala Glu Ser Leu His Leu Met 
705 710 715 720 



He Lys Lys His Thr Leu Ala Ser Gly Gly Arg Arg Phe Pro Phe Ser 
725 730 735 
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lie Lys Ala Ser Lys Ser Leu Asp Gly His Ser Pro Ser Pro Thr Ser 
740 745 750 



Glu Ser Ser Glu Pro Asp Leu Glu Ser Gin Tyr Pro Gly Ser Gly Ser 
755 760 765 



lie Pro Pro Asn Gin Pro Ser Gly Asp Pro Gin Gin Pro Ser Pro Asp 
770 775 780 



Ser Thr Ala Ala Gin Lys Val Ala Thr Ser Pro Lys Ser Ala Leu Lys 
785 790 795 800 



Ser Pro Ser Ser Lys Arg Arg Thr Ser Gin Asn Leu Lys Leu Arg Val 
805 810 815 



Thr Phe Glu Glu Pro Val Val Gin Met Glu Gin Pro Ser Leu Glu Leu 
820 825 830 



Asn Gly Glu Lys Asp Lys Asp Lys Gly Arg Thr Leu Gin Arg Thr Ser 
835 840 845 



Thr Ser Asn Glu Ser Gly Asp Gin Leu Lys Arg Pro Phe Gly Ala Phe 
850 855 860 



Arg Ser lie Met Glu Thr Leu Ser Gly Asn Gin Asn Asn Asn Asn Asn 
865 870 875 880 



Tyr Gin Ala Ala Asn Gin Leu Lys Thr Ser Thr Leu Pro Leu Thr Ser 
885 890 895 



Leu Gly Arg Lys Thr Asp Ala Lys Gly Asn Pro Ala Ser Ser Ala Ser 
900 905 910 



Lys Gly Lys Asn Lys Ala Ala 
915 



<210> 3 

<211> 4181 

<212> DNA 

<213> NMJ)02387 MCC, mutated in colorectal cancers 
<220> 

<221> CDS 

<222> (221) . . (2707) 

<223> 
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<400> 3 

cctcctgcag caatggctcg tccgtgaaac gcgagccacg gctgctcttt ttaagagtgc 60 

ctgcatcctc cgtttgcgct tcgcaactgt cctgggtgaa aatggctgtc tagactaaaa 120 

tgtggcagaa gggaccaagc agtggatatt gagcctgtga agtccaactc ttaagctccg 180 

agacctgggg gactgagagc ccagctctga aaagtgcatc atg aat tec gga gtt 235 

Met Asn Ser Gly Val 
1 5 

gcc atg aaa tat gga aac gac tec teg gcc gag ctg agt gag etc cat 283 
Ala Met Lys Tyr Gly Asn Asp Ser Ser Ala. Glu Leu Ser Glu Leu His 
10 15 ' 20 

tea gca gcc ctg gca tea eta aag gga gat ata gtg gaa ctt aat aaa 331 
Ser Ala Ala Leu Ala Ser Leu Lys Gly Asp He Val Glu Leu Asn Lys 
25 30 35 

cgt etc cag caa aca gag agg gaa egg gac ctt ctg gaa aag aaa ttg 379 
Arg Leu Gin Gin Thr Glu Arg Glu Arg Asp Leu Leu Glu Lys Lys Leu 
40 45 50 

gcc aag gca cag tgc gag cag tec cac etc atg aga gag cat gag gat 427 
Ala Lys Ala Gin Cys Glu Gin Ser His Leu Met Arg Glu His Glu Asp 
55 60 65 

gtc cag gag cga acg acg ctt cgc tat gag gaa cgc ate aca gag etc 475 
Val Gin Glu Arg Thr Thr Leu Arg Tyr Glu Glu Arg He Thr Glu Leu 
70 75 80 85 

cac age gtc att gcg gag etc aac aag aag ata gac cgt ctg caa ggc 523 
His Ser Val He Ala Glu Leu Asn Lys Lys He Asp Arg Leu Gin Gly 
90 95 100 

ace ace ate agg gag gaa gat gag tac tea gaa ctg cga tea gaa etc 571 
Thr Thr He Arg Glu Glu Asp Glu Tyr Ser Glu Leu Arg Ser Glu Leu 
105 no us 

age cag age caa cac gag gtc aac gag gac tct cga age atg gac caa 619 
Ser Gin Ser Gin His Glu Val Asn Glu Asp Ser Arg Ser Met Asp Gin 
120 125 130 

gac cag acc tct gtc tct ate ccc gaa aac cag tct acc atg gtt act 
Asp Gin Thr Ser Val Ser He Pro Glu Asn Gin Ser Thr Met Val Thr 
135 140 145 



667 



get gac atg gac aac tgc agt gac ctg aac tea gaa ctg cag agg gtg 715 
Ala Asp Met Asp Asn Cys Ser Asp Leu Asn Ser Glu Leu Gin Arg Val 
150 155 160 165 

ctg aca ggg ctg gag aat gtt gtc tgc ggc agg aag aag age age tgc 763 
Leu Thr Gly Leu Glu Asn Val Val Cys Gly Arg Lys Lys Ser Ser Cys 
170 175 180 

age etc tec gtg gcc gag gtg gac agg cac att gag cag etc acc aca 811 
Ser Leu Ser Val Ala Glu Val Asp Arg His He Glu Gin Leu Thr Thr 
185 190 195 

gcc age gag cac tgt gac ctg get att aag aca gtc gag gag att gag 859 
Ala Ser Glu His Cys Asp Leu Ala He Lys Thr Val Glu Glu He Glu 
200 205 210 

ggg gtg ctt ggc egg gac ctg tat ccc aac ctg get gaa gag agg tct 907 
Gly Val Leu Gly Arg Asp Leu Tyr Pro Asn Leu Ala Glu Glu Arg Ser 
215 220 225 
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egg tgg gag aag gag ctg get ggg ctg agg gaa gag aat gag age ctg 955 
Arg Trp Glu Lys Glu Leu Ala Gly Leu Arg Glu Glu Asn Glu Ser Leu 
230 235 240 245 

act gee atg ctg tgc age aaa gag gaa gaa ctg aac egg act aag gee 1003 
Thr Ala Met Leu Cys Ser Lys Glu Glu Glu Leu Asn Arg Thr Lys Ala 
250 255 260 

ace atg aat gee ate egg gaa gag egg gac egg etc egg agg egg gtc 1051 
Thr Met Asn Ala lie Arg Glu Glu Arg Asp Arg Leu Arg Arg Arg Val 
265 270 275 

aga gag ctt caa act cga eta cag age gtg cag gee aca ggt ccc tec 1099 
Arg Glu Leu Gin Thr Arg Leu Gin Ser Val Gin Ala Thr Gly Pro Ser 
280 285 290 

age cct ggc cgc etc act tec ace aac cgc ccg att aac ccc age act 1147 
Ser Pro Gly Arg Leu Thr Ser Thr Asn Arg Pro lie Asn Pro Ser Thr 
295 300 305 

ggg gag ctg age aca age age age age aat gac att ccc ate gee aag 1195 
Gly Glu Leu Ser Thr Ser Ser Ser Ser Asn Asp lie Pro lie Ala Lys 
310 315 320 325 

att get gag agg gtg aag eta tea aag aca agg tec gaa teg tea tea 1243 
lie Ala Glu Arg Val Lys Leu Ser Lys Thr Arg Ser Glu Ser Ser Ser 
330 335 340 

tct gat egg cca gtc ctg ggc tea gaa ate agt age ata ggg gta tec 1291 
Ser Asp Arg Pro Val Leu Gly Ser Glu lie Ser Ser lie Gly Val Ser 
345 " . 350 355 

age agt gtg get gaa cac ctg gee cac tea ctt cag gac tgc tec aat 1339 
Ser Ser Val Ala Glu His Leu Ala His Ser Leu Gin Asp Cys Ser Asn 
360 365 370 

ate caa gag att ttc caa aca etc tac tea cac gga tct gee ate tea 1387 
lie Gin Glu lie Phe Gin Thr Leu Tyr Ser His Gly Ser Ala lie Ser 
375 380 385 

gaa age aag att aga gag ttt gag gtg gaa aca gaa egg ctg aat age 1435 
Glu Ser Lys lie Arg Glu Phe Glu Val Glu Thr Glu Arg Leu Asn Ser 
390 395 400 405 

egg att gag cac etc aaa tec caa aat gac etc ctg ace ata ace ttg 1483 
Arg lie Glu His Leu Lys Ser Gin Asn Asp Leu Leu Thr lie Thr Leu 
410 415 420 

gag gaa tgt aaa age aat get gag agg atg age atg ctg gtg gga aaa 1531 
Glu Glu Cys Lys Ser Asn Ala Glu Arg Met Ser Met Leu Val Gly Lys 
425 430 435 

tac gaa tec aat gee aca gcg ctg agg ctg gee ttg cag tac age gag 1579 
Tyr Glu Ser Asn Ala Thr Ala Leu Arg Leu Ala Leu Gin Tyr Ser Glu 
440 445 450 

cag tgc ate gaa gee tac gaa etc etc ctg gcg ctg gca gag agt gag 1627 
Gin Cys He Glu Ala Tyr Glu Leu Leu Leu Ala Leu Ala Glu Ser Glu 
455 460 465 

cag age etc ate ctg ggg cag ttc cga gcg gcg ggc gtg ggg tec tec 1675 
Gin Ser Leu He Leu Gly Gin Phe Arg Ala Ala Gly Val Gly Ser Ser 
470 475 480 485 

cct gga gac cag teg ggg gat gaa aac ate act cag atg etc aag cga 1723 
Pro Gly Asp Gin Ser Gly Asp Glu Asn lie Thr Gin Met Leu Lys Arg 
490 . 495 500 
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get cat gac tgc egg aag aca get gag aac get gee aag gee ctg etc 1771 
Ala His Asp Cys Arg I>ys Thr Ala Glu Asn Ala Ala Lys Ala Leu Leu 
505 510 515 

atg aag ctg gac ggc age tgt ggg gga gee ttt gee gtg gee ggc tgc 1819 
Met Lys Leu Asp Gly Ser Cys Gly Gly Ala Phe Ala Val Ala Gly Cys 
520 * * 525 530 

age gtg cag ccc tgg gag age ctt tec. tec aac age cac acc age aca .1867 
Ser Val Gin Pro Trp Glu Ser Leu Ser Ser Asn Ser His Thr Ser Thr 
535 540 545 

acc age tec aca gec agt agt tgc gac acc gag ttc act aaa gaa gac 1915 
Thr Ser Ser Thr Ala Ser Ser Cys Asp Thr Glu Phe Thr Lys Glu Asp 
550 555 560 565 

gag cag agg ctg aag gat tat ate cag cag etc aag aat gac agg get 1963 
Glu Gin Arg Leu Lys Asp Tyr He Gin Gin Leu Lys Asn Asp Arg Ala 
570 575 580 

gcg gtc aag ctg acc atg ctg gag ctg gaa age ate cac ate gat cct 2011 
Ala Val Lys Leu Thr Met Leu Glu Leu Glu Ser He His He Asp Pro 
585 590 595 

etc age tat gac gtc aag cct egg gga gac age cag agg ctg gat ctg 2059 
Leu Ser Tyr Asp Val Lys Pro Arg Gly Asp Ser Gin Arg Leu Asp Leu 
600 605 610 

gaa aac gca gtg ctt atg cag gag etc atg gec atg aag gag gag atg 2107 
Glu Asn Ala Val Leu Met Gin Glu Leu Met Ala Met Lys Glu Glu Met 
615 620 625 

gec gag ttg aag gec cag etc tac eta ctg gag aaa gag aag aag gec 2155 
Ala Glu Leu Lys Ala Gin Leu Tyr Leu Leu Glu Lys Glu Lys Lys Ala 
630 635 640 645 

ctg gag ctg aag ctg age acg egg gag gec cag gag cag gec tac ctg 2203 
Leu Glu Leu Lys Leu Ser Thr Arg Glu Ala Gin Glu Gin Ala Tyr Leu 
650 655 660 

gtg cac att gag cac ctg aag tec gag gtg gag gag cag aag gag cag 2251 
Val His He Glu His Leu Lys Ser Glu Val Glu Glu Gin Lys Glu Gin 
665 670 675 

egg atg cga tec etc age tec acc age age ggc age aaa gat aaa cct 2299 
Arg Met Arg Ser Leu Ser Ser Thr Ser Ser Gly Ser Lys Asp Lys Pro 
680 685 690 

ggc aag gag tgt get gat get gec tec cca get ctg tec eta get gaa 2347 
Gly Lys Glu Cys Ala Asp Ala Ala Ser Pro Ala Leu Ser Leu Ala Glu 
695 700 705 

etc agg aca acg tgc age gag aat gag ctg get gcg gag ttc acc aac 2395 
Leu Arg Thr Thr Cys Ser Glu Asn Glu Leu Ala Ala Glu Phe Thr Asn 
710 715 720 725 

gec att cgt cga gaa aag aag ttg aag gec aga gtt caa gag ctg gtg 2443 
Ala He Arg Arg Glu Lys Lys Leu Lys Ala Arg Val Gin Glu Leu Val 
730 735 740 

agt gec ttg gag aga etc acc aag age agt gaa ate cga cat cag caa 2491 
Ser Ala Leu Glu Arg Leu Thr Lys Ser Ser Glu He Arg His Gin Gin 
745 750 755 

tct gca gag ttc gtg aat gat eta aag egg gec aac age aac ctg gtg 2539 
Ser Ala Glu Phe Val Asn Asp Leu Lys Arg Ala Asn Ser Asn Leu Val 
760 765 770 



WO 2004/022778 



12 



PCT/AU2003/001166 



get gec tat gag aaa gca aag aaa aag cat caa aac aaa ctg aag aag 2587 

Ala Ala Tyr Glu Lys Ala Lys Lys Lys .His Gin Asn Lys Leu Lys Lys 

775 780 785 

tta gag teg cag atg atg gec atg gtg gag aga cat gag acc caa gtg 2635 

Leu Glu Ser Gin Met Met Ala Met Val Glu Arg His Glu Thr Gin Val 
790 795 800 805 

agg atg etc aag caa aga ata get ctg eta gag gag gag aac tec agg 2683 

Arg Met Leu Lys Gin Arg lie Ala Leu Leu Glu Glu Glu Asn Ser Arg 
810 815 . 820 

cca cac acc aat gaa act teg ctt taatcagcac tcacgcaccg gagttctgee 2737 
Pro His Thr Asn Glu Thr Ser Leu 
825 



catgggaagt 


aaactgeage 


aggecactgg 


ggacagaagg 


gcccatgtac 


ttgttgggag 


2797 


gaggaggaaa 


gggaaggctg 


gcaggtaggt 


eggcacttgg 


acaatggagt 


gccccaactc 


2857 


aacccttggg 


gtgactggcc 


atggtgacat 


tgtggactgt 


atccagaggt 


gcccgctctt 


2917 


ccctcctggg 


cccacaacag 


cgtgtaaaca 


catgttctgt 


gcctgctcag 


cagagcctcg 


2977 


tttctgcttt 


cagcactcac 


tctccccctc 


ctcttctggt 


ctggcggctg 


tgcatcagtg 


3037 


ggatcccaga 


catttgtttc 


tgtaagattt 


tccattgtat 


cctctttttg 


gtagatgctg 


3097 


ggctcatctt 


ctagaatctc 


gtttctcctc 


tttcctcctg 


cttcatggga 


aaacagacct 


3157 


gtgtgtgcct 


ccagcattta 


aaaggactgc 


tgatttgttt 


actacagcaa 


ggctttggtt 


3217 


tccaagtccc 


gggtctcaac 


tttaagatag 


aggeggecat 


aagaggtgat 


ctctgggagt 


3277 


tataggtcat 


gggaagagcg 


tagacaggtg 


ttacttacag 


tcccagatac 


actaaagtta 


3337 


caaacagacc 


accaccagga 


ctgtgcctga 


acaattttgt 


attgagagaa 


taaaaacttc 


3397 


cttcaatctt 


cattttggag 


gcagggctgg 


gaagggagcg 


ctctcttgat 


tctgggattt 


3457 


ctccctctca 


gtggagcctt 


attaatatcc 


aagacttaga 


gctgggaatc 


tttttgatac 


3517 


ctgtagtgga 


actaaaattc 


tgtcaggggt 


ttcttcaaga 


gctgagaaac 


attattagca 


3577 


cttcccgccc 


cagggcacta 


cataattget 


gttctgctga 


atcaaatctc 


ttccacatgg 


3637 


gtgcatttgt 


agctctggac 


ctgtctctac 


ctaaggacaa 


gacactgagg 


agatactgaa 


3697 


cattttgeaa 


aacttatcac 


gectacttaa 


gagtgctgtg 


taacccccag 


ttcaagactt 


3757 


agctcctgtt 


gtcatgaegg 


ggacagagtg 


agggaatggt 


agttaaggct 


tettttttge 


3817 


ccccagatac 


atggtgatgg 


ttagcatatg 


gtgcttaaaa 


ggttaaattt 


caagcaaaat 


3877 


gcttacaggg 


ctaggcagta 


ccaaagtaac 


tgaattattt 


caggaaggtc 


ttcaatctta 


3937 


aaacaaattc 


attattcttt 


ttcagtttta . 


cctcttctct 


ctcagttcta 


cactgataca 


3997 


cttgaaggac 


catttactgt 


ttttttctgt 


agcaccagag 


aatccatcca 


aagttcccta 


4057 


tgaaaaatgt 


gttccattgc 


catagctgac 


tacaaattaa 


agttgaggag 


gtttctgeat 


4117 


agagtcttta 


tgtccataag 


etaegggtag 


gtctattttc 


agagcatgat 


acaaattcca 


4177 


cagg 












4181 
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<210> 4 

,<211> 829 

<212> PRT 

<213> NM_002387 MCC, mutated in colorectal cancers 



<400> 4 

Met Asn Ser Gly Val Ala Met Lys Tyr Gly Asn Asp Ser Ser Ala Glu 
1 5 .10 15 



Leu Ser Glu Leu His Ser Ala Ala Leu Ala Ser Leu Lys Gly Asp lie 
20 25 30 



Val Glu Leu Asn Lys Arg Leu Gin Gin Thr Glu Arg Glu Arg Asp Leu 
35 40 45 



Leu Glu Lys Lys Leu Ala Lys Ala Gin Cys Glu Gin Ser His Leu Met 
50 55 60 



Arg Glu His Glu Asp Val Gin Glu Arg Thr Thr Leu Arg Tyr Glu Glu 
65 * 70 75 ~ 80 



Arg lie Thr Glu Leu His Ser Val lie Ala Glu Leu Asn Lys Lys lie 
85 90 95 

Asp Arg Leu Gin Gly Thr Thr lie Arg Glu Glu Asp Glu Tyr Ser Glu 
100 105 110 

Leu Arg Ser Glu Leu Ser Gin Ser Gin His Glu Val Asn Glu Asp Ser 
115 120 125 

Arg Ser Met Asp Gin Asp Gin Thr Ser Val Ser lie Pro Glu Asn Gin 
130 135 140 



Ser Thr Met Val Thr Ala Asp Met Asp Asn Cys Ser Asp Leu Asn Ser 
145 150 * 155 160 



Glu Leu Gin Arg Val Leu Thr Gly Leu Glu Asn Val Val Cys Gly Arg 
165 170 175 



Lys Lys Ser Ser Cys Ser Leu Ser Val Ala Glu Val Asp Arg His lie 
180 185 190 

Glu Gin Leu Thr Thr Ala Ser Glu His Cys Asp Leu Ala lie Lys Thr 
195 200 205 

Val Glu Glu lie Glu Gly Val Leu Gly Arg Asp Leu Tyr Pro Asn Leu 
210 215 220 
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Ala Glu Glu Arg Ser Arg Trp Glu Lys Glu Leu Ala Gly Leu Arg Glu 
225 230 235 240 



Glu Asn Glu Ser Leu Thr Ala Met Leu Cys Ser Lys Glu Glu Glu Leu 
245 250 255 



Asn Arg Thr Lys Ala Thr Met Asn Ala He Arg Glu Glu Arg Asp Arg 
260 265 270 



Leu Arg Arg Arg Val Arg Glu Leu Gin Thr Arg Leu Gin Ser Val Gin 
275 280 285 



Ala Thr Gly Pro Ser Ser Pro Gly Arg Leu Thr Ser Thr Asn Arg Pro 
290 295 300 



He Asn Pro Ser Thr Gly Glu Leu Ser Thr Ser Ser Ser Ser Asn Asp 
3.05 310 315 320 



He Pro He Ala Lys He Ala Glu Arg Val Lys Leu Ser Lys Thr Arg 
325 330 335 



Ser Glu Ser Ser Ser Ser Asp Arg Pro Val Leu Gly Ser Glu He Ser 
340 345 350 



Ser He Gly Val Ser Ser Ser Val Ala Glu His Leu Ala His Ser Leu 
355 360 365 



Gin Asp Cys Ser Asn He Gin Glu He Phe Gin Thr Leu Tyr Ser His 
370 375 380 



Gly Ser Ala He Ser Glu Ser Lys He Arg Glu. Phe Glu Val Glu Thr 
385 390 395 400 



Glu Arg Leu Asn Ser Arg He Glu His Leu Lys Ser Gin Asn Asp Leu 
405 410 415 



Leu Thr He Thr Leu Glu Glu Cys Lys Ser Asn Ala Glu Arg Met Ser 
420 425 430 



Met Leu Val Gly Lys Tyr Glu Ser Asn Ala Thr Ala Leu Arg Leu Ala 
435 440 445 



Leu Gin Tyr Ser Glu Gin Cys He Glu Ala Tyr Glu Leu Leu Leu Ala 
450 455 460 



Leu Ala Glu Ser Glu Gin Ser Leu He Leu Gly Gin Phe Arg Ala Ala 
465 470 475 480 



Gly Val Gly Ser Ser Pro Gly Asp Gin Ser Gly Asp Glu Asn He Thr 
485 490 • 495 
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Gin Met Leu Lys Arg Ala His Asp Cys Arg Lys Thr Ala Glu Asn Ala 
500 505 510 



Ala Lys Ala Leu Leu Met Lys Leu Asp Gly Ser Cys Gly Gly Ala Phe 
515 520 525 



Ala Val Ala Gly Cys Ser Val Gin Pro Trp Glu Ser Leu Ser Ser Asn 
530 535 540 



Ser His Thr Ser Thr Thr Ser Ser Thr Ala Ser Ser Cys Asp Thr Glu 
545 " 550 555 560 



Phe Thr Lys Glu Asp Glu Gin Arg Leu Lys Asp Tyr He Gin Gin Leu 
565 570 575 



Lys Asn Asp Arg Ala Ala Val Lys Leu Thr Met Leu Glu Leu Glu Ser 
580 585 590 



He His He. Asp Pro Leu Ser Tyr Asp Val Lys Pro Arg Gly Asp Ser 
595 600 605 



Gin Arg Leu Asp Leu Glu Asn Ala Val Leu Met Gin Glu Leu Met Ala 
610 615 620 



Met Lys Glu Glu Met Ala Glu Leu Lys Ala Gin Leu Tyr Leu Leu Glu 
625 630 635 640 



Lys Glu Lys Lys Ala Leu Glu Leu Lys Leu Ser Thr Arg Glu Ala Gin 
645 650 655 



Glu Gin Ala Tyr Leu Val His He Glu His Leu Lys Ser Glu Val Glu 
660 665 670 



Glu Gin Lys Glu Gin Arg Met Arg Ser Leu Ser Ser Thr Ser Ser Gly 
675 680 685 



Ser Lys Asp Lys Pro Gly Lys Glu Cys Ala Asp Ala Ala Ser Pro Ala 
690 695 700 



Leu Ser Leu Ala Glu Leu Arg Thr Thr Cys Ser Glu Asn Glu Leu Ala 
705 710 715 720 



Ala Glu Phe Thr Asn Ala He Arg Arg Glu Lys Lys Leu Lys Ala Arg 
725 730 735 



Val Gin Glu Leu Val Ser Ala Leu Glu Arg Leu Thr Lys Ser Ser Glu 
740 745 750 



He Arg His Gin Gin Ser Ala Glu Phe Val Asn Asp Leu Lys Arg Ala 
755 760 765 
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Asn Ser Asn Leu Val Ala Ala Tyr Glu Lys Ala Lys Lys Lys His Gin 
770 775 780 



Asn Lys Leu Lys Lys Leu Glu Ser Gin Met Met Ala Met Val Glu Arg 
785 790 795 800 



His Glu Thr Gin Val Arg Met Leu Lys Gin Arg He Ala Leu Leu Glu 
805 810 815 

Glu Glu Asn Ser Arg Pro His Thr Asn Glu Thr Ser Leu 
820 825 



<210> 5 

<211> 2830 

<212> DNA 

<213> NM_022117 SE20-4 
<220> 

<221> CDS 

<222> (130) .. (2208) 

<223> 



<400> 5 

aattcggcac gaggagagct ggttgcgtga gtctcctcag ctctgcttac cggtgcgact 60 

agcggcagcg acgcggctaa aagcgaaggg gcgagtgcga gtcccctgag ctgtacgaac 120 

gcggtcgcc atg gac cgc cca gat gag ggg cct ccg gcc aag acc cgc cgc 171 
Met Asp Arg Pro Asp Glu Gly Pro Pro Ala Lys Thr Arg Arg 
1 5 10 

ctg age age tec gag tct cca cag cgc gac ccg ccc ccg ccg ccg ccg 219 
Leu Ser Ser Ser Glu Ser Pro Gin Arg Asp Pro Pro Pro Pro Pro Pro 
15 20 25 30 

ccg ccg ccg etc etc cga ctg ccg ctg cct cca ccc cag cag cgc ccg 267 
Pro Pro Pro Leu Leu Arg Leu Pro Leu Pro Pro Pro Gin Gin Arg Pro 
35 40 45 

agg etc cag gag gaa acg gag gcg gca cag gtg ctg gcc gat atg agg 315 
Arg Leu Gin Glu Glu Thr Glu Ala Ala Gin Val Leu Ala Asp Met Arg 
50 55 60 

ggg gtg gga ctg ggc ccc gcg ctg ccc ccg ccg cct ccc tat gtc att 363 
Gly Val Gly Leu Gly Pro Ala Leu Pro- Pro Pro Pro Pro Tyr Val He 
65 70 75 

etc gag gag ggg ggg ate cgc gca tac ttc acg etc ggt get gag tgt 411 
Leu Glu Glu Gly Gly He Arg Ala Tyr Phe Thr Leu Gly Ala Glu Cys 
8.0 85 90 

ccc ggc tgg gat tct acc ate gag teg ggg tat ggg gag gcg ccc ccg 459 
Pro Gly Trp Asp Ser Thr He Glu Ser Gly Tyr Gly Glu Ala Pro Pro 
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95 100 105 110 

ccc acg gag age ctg gaa gca etc ccc act cct gag gec teg ggg ggg 507 
Pro Thr Glu Ser Leu Glu Ala Leu Pro. Thr Pro Glu Ala Ser Gly Gly 
115 120 125 

age ctg gaa ate gat ttt cag gtt gta cag teg age agt ttt ggt gga 555 
Ser Leu Glu lie Asp Phe Gin Val Val Gin Ser Ser Ser Phe Gly Gly 
130 135 140 

gag ggg gee eta gaa acc tgt age gca gtg ggg tgg gcg ccc cag agg 603 
Glu Gly Ala Leu Glu Thr Cys Ser Ala Val Gly Trp Ala Pro Gin Arg 
145 150 155 

tta gtt gac ccg aag age aag gaa gag gcg ate ate ata gtg gag gat 651 
Leu Val Asp Pro Lys Ser Lys Glu Glu Ala lie He He Val Glu Asp 
160 165 170 

gag gat gag gat gag egg gag agt atg agg age age agg agg egg egg 699 
Glu Asp Glu Asp Glu Arg Glu Ser Met Arg Ser Ser Arg Arg Arg Arg 
175 180 185 190 

egg egg egg agg agg aag cag agg aag gtg aag agg gaa age aga gag 747 
Arg Arg Arg Arg Arg Lys Gin Arg Lys Val Lys Arg Glu Ser Arg Glu 
195 200 205 

aga aat gee gag agg atg gag age ate ctg cag gca ctg gag gat att 795 
Arg Asn Ala Glu Arg Met Glu Ser He Leu Gin Ala Leu Glu Asp He 
210 215 220 

cag ctg gat ctg gag gca gtg aac ate aag gca ggc aaa gee ttc ctg 843 
Gin Leu Asp Leu Glu Ala Val Asn He Lys Ala Gly -Lys Ala Phe Leu 
225 230 235 

cgt etc aag cgc aag ttc ate cag atg cga aga ccc ttc ctg gag cgc 891 
Arg Leu Lys Arg Lys Phe lie Gin Met Arg Arg Pro Phe Leu Glu Arg 
240 245 250 

aga gac etc ate ate cag cat ate cca ggc ttc tgg gtc aaa gca ttc 939 
Arg Asp Leu He He Gin His He Pro Gly Phe Trp Val Lys Ala Phe 
255 260 265 270 

etc aac cac ccc aga att tea att ttg ate aac cga cgt gat gaa gac 987 
Leu Asn His Pro Arg He Ser He Leu He Asn Arg Arg Asp Glu Asp 
275 280 285 

att ttc cgc tac ttg acc aat ctg cag gta cag gat etc aga cat ate 1035 
He Phe Arg Tyr Leu Thr Asn Leu Gin Val Gin Asp Leu Arg His He 
290 295 300 

tec atg ggc tac aaa atg aag ctg tac ttc cag act aac ccc tac ttc 1083 
Ser- Met Gly Tyr Lys Met Lys Leu Tyr Phe Gin Thr Asn Pro Tyr Phe 
305 310 315 

aca aac atg gtg att gtc aag gag ttc cag cgc aac cgc tea ggc egg 1131 
Thr Asn Met Val He Val Lys Glu Phe Gin Arg Asn Arg Ser Gly Arg 
320 325 330 

ctg gtg tct cac tea acc cca ate cgc tgg cac egg ggc cag gaa ccc 1179 
Leu Val Ser His Ser Thr Pro He Arg Trp His Arg Gly Gin Glu Pro 
335 340 * 345 350 

cag gee cgt cgt cac ggg aac cag gat gcg age cac age ttt ttc age 1227 
Gin Ala Arg Arg His Gly Asn Gin Asp Ala .Ser His Ser Phe Phe Ser 
355 360 365 

tgg ttc tea aac cat age etc cca gag get gac agg att get gag att 1275 
Trp Phe Ser Asn His Ser Leu Pro Glu Ala Asp Arg He Ala Glu He 
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370 375 . 380 

ate aag aat gat ctg tgg gtt aac cct eta cgc tac tac ctg aga gaa 1323 
lie Lys Asn Asp Leu Trp Val Asn Pro Leu Arg Tyr Tyr Leu Arg Glu 
385 ' 390 395 

a<J9f ggc tec agg ata aag aga aag aag caa gaa atg aag aaa cgt aaa 1371 
Arg Gly Ser Arg He Lys Arg Lys Lys Gin Glu Met Lys Lys Arg Lys 
400 405 410 

ace agg ggc aga tgt gag gtg gtg ate atg gaa gac gee cct gac tat 1419 
Thr Arg Gly Arg Cys Glu Val Val He Met Glu Asp Ala Pro Asp Tyr 
415 420 425 430 

tat gca gtg gaa gac att ttc age gag ate tea gac att gat gag aca 1467 
Tyr Ala Val Glu Asp He Phe Ser Glu He Ser Asp He Asp Glu Thr 
435 440 445 

att cat gac ate aag ate tct gac ttc atg gag ace ace gac tac ttc 1515 
He His Asp He Lys He Ser Asp Phe Met Glu Thr Thr Asp Tyr Phe 
450 455 460 

gag ace act gac aat gag ata act gac ate aat gag aac ate tgc gac 1563 
Glu Thr Thr Asp Asn Glu He Thr Asp lie Asn Glu Asn He Cys Asp 
465 470 475 

age gag aat cct gac cac aat gag gtc ccc aac aac gag ace act gat 1611 
Ser Glu Asn Pro Asp His Asn Glu Val Pro Asn Asn Glu Thr Thr Asp 
480 485 490 

aac aac gag agt get gat gac cac gaa ace act gac aac aat gag agt 1659 
Asn Asn Glu Ser Ala Asp Asp His Glu Thr Thr Asp Asn Asn Glu Ser 
495 500 ~ 505 " 510 

gca gat gac aac aac gag aat cct gaa gac aat aac aag aac act gat 1707 
Ala Asp Asp Asn Asn Glu Asn Pro Glu Asp Asn Asn Lys Asn Thr Asp 
515 520 525 

gac aac gaa gag aac cct aac aac aac gag aac act tac ggc aac aac 1755 
Asp Asn Glu Glu Asn Pro Asn Asn Asn Glu Asn Thr Tyr Gly Asn Asn 
530 535 540 

ttc ttc aaa ggt ggc ttc tgg ggc age cat ggc aac aac cag gac age 1803 
Phe Phe Lys Gly Gly Phe Trp Gly Ser His Gly Asn Asn Gin Asp Ser 
545 550 555 

age gac agt gac aat gaa gca gat gag gee agt gat gat gaa gat aat 1851 
Ser Asp Ser Asp Asn Glu Ala Asp Glu Ala Ser Asp Asp Glu Asp Asn 
560 565 570 

gat ggc aac gaa ggt gac aat gag ggc agt gat gat gat ggc aat gaa 1899 
Asp Gly Asn Glu Gly Asp Asn Glu Gly Ser Asp Asp Asp Gly Asn Glu 
575 580 585 590 

ggt gac aat gaa ggc age gat gat gac gac aga gac att gag tac tat 1947 
Gly Asp Asn Glu Gly Ser Asp Asp Asp Asp Arg Asp He Glu Tyr Tyr 
595 600 605 

gag aaa gtt att gaa gac ttt gac aag gat cag get gac tac gag gac 1995 
Glu Lys Val He Glu Asp Phe Asp Lys Asp Gin Ala Asp Tyr Glu Asp 
610 615 620 

gtg ata gag ate ate tea gac gaa tea gtg gaa gaa gag ggc att gag 2043 
Val lie Glu lie lie Ser Asp Glu Ser Val Glu Glu Glu Gly lie Glu 
625 630 635 

gaa ggc ate cag caa gat gag gac ate tat gag gaa gga aac tat gag 2091 
Glu Gly He Gin Gin Asp Glu Asp lie Tyr Glu Glu Gly Asn Tyr Glu 
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640 645 650 

gag gaa gga agt gaa gat gtc tgg gaa gaa ggg gaa gat teg gac gac 2139 
Glu Glu Gly Ser Glu Asp Val Trp Glu Glu Gly Glu Asp Ser Asp Asp 
655 660 • 665 670 

tct gac eta gag gat gtg ctt cag gtc cca aac ggt tgg gec aat ccg 2187 
Ser Asp Leu Glu Asp Val Leu Gin Val Pro Asn Gly Trp Ala Asn Pro 
675 680 685 

ggg aag agg ggg aaa acc gga taagggtttt ccccttttgg ggatcacctc 2238 
Gly Lys Arg Gly Lys Thr Gly 
690 

tctgtatccc ccacccacta tcccatttgc cctcctcctc agetagggee acgcggcccc 22 98 

acattgeact tctggggggt gaccgacttc gtacacgggt ttaaagttta tttttatggt 2358 

ttagtcattg cagagttctt attttggggg gagggaaagg gggctagtcc ccttcttttg 2418 

gccctccgcc cccgcaggct tctgtgtgct gctaactgta tttattgtga tgccttggtc 2478 

agggcccctc tacccacttc tcccagtcag ttgtggcccc agcccctctc cctgtgctgt 2538 

gtggagtgga caccctgacc cccgaagcgg ggagggcege tgtggccttc gtcacagccg 2598 

cgcagtgccc atggaggcgc tgctgccacc ttcctctccc. aagttctttc tccatccctc 2658 

tcctcttccc gccgcgccgc tagcccgcct cggtgtctat gcaaggccgc ttcgccattg 2718 

eggtattett tgeggtatte ttgtccccgt cccccagaag gctcgcctct ccccgtggac 2778 

cctgttaatc ccaataaaat tctgagcaag ttcaaaaaaa aaaaaaaaaa aa 2830 

<210> 6 

<211> 693 

<212> PRT 

<213> NMJD22117 SE20-4 

<400> 6 

Met Asp Arg Pro Asp Glu Gly Pro Pro Ala Lys Thr Arg Arg Leu Ser 
1 5 10 15 



Ser Ser Glu Ser Pro Gin Arg Asp Pro Pro Pro Pro Pro Pro Pro Pro 
20 25 30 



Pro Leu Leu Arg Leu Pro Leu Pro Pro Pro Gin Gin Arg Pro Arg Leu 
35 40 45 



Gin Glu Glu Thr Glu Ala Ala Gin Val Leu Ala Asp Met Arg Gly Val 
50 55 60 



Gly Leu Gly Pro Ala Leu Pro Pro Pro Pro Pro Tyr. Val lie Leu Glu 
65 70 75 80 



Glu Gly Gly lie Arg Ala Tyr Phe Thr Leu Gly Ala Glu Cys Pro Gly 
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Trp Asp Ser Thr lie Glu Ser Gly Tyr Gly Glu Ala Pro Pro Pro Thr 
100 105 110 

Glu Ser Leu Glu Ala Leu Pro Thr Pro Glu Ala Ser Gly Gly Ser Leu 
115 120 125 

Glu IleAsp Phe Gin Val Val Gin Ser Ser Ser Phe Gly Gly Glu Gly 
130 135 140 

Ala Leu Glu Thr Cys Ser Ala Val Gly Trp Ala Pro Gin Arg Leu Val 
i45 150 155 160 

Asp Pro Lys Ser Lys Glu Glu Ala He He He Val Glu Asp Glu Asp 
165 170 175 

Glu Asp Glu Arg Glu Ser Met Arg Ser Ser Arg Arg Arg Arg Arg Arg 
180 185 190 

Arg Arg Arg Lys Gin Arg Lys Val Lys Arg Glu Ser Arg Glu Arg Asn 
195 200 205 

Ala Glu Arg Met Glu Ser He Leu Gin Ala Leu Glu Asp He Gin Leu 
210 215 220 

Asp Leu Glu Ala Val Asn He Lys Ala Gly Lys Ala Phe Leu Arg Leu 
225 230 . 235 240 

Lys Arg Lys Phe He Gin Met Arg Arg Pro Phe Leu Glu Arg Arg Asp 
245 250 255 

Leu He He Gin His He Pro Gly Phe Trp Val Lys Ala Phe Leu Asn 
260 265 270 

His Pro Arg He Ser He Leu He Asn Arg Arg Asp Glu Asp He Phe 
275 280 285 

Arg Tyr Leu Thr Asn Leu Gin Val Gin Asp Leu Arg His He Ser Met 
290 295 300 

Gly Tyr Lys Met Lys Leu Tyr Phe Gin Thr Asn Pro Tyr Phe Thr Asn 
305 310 315 320 

Met Val He Val Lys Glu Phe Gin Arg Asn Arg Ser Gly Arg Leu Val 
325 330 335 

Ser His Ser Thr Pro He Arg Trp His Arg Gly Gin Glu Pro Gin Ala 
340 345 350 



Arg Arg His Gly Asn Gin Asp Ala Ser His Ser Phe Phe Ser Trp Phe 
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Ser Asn His Ser Leu Pro Glu Ala Asp Arg lie Ala Glu lie lie Lys 
370 375 " 380 

Asn Asp Leu Trp Val Asn Pro Leu Arg Tyr Tyr Leu Arg Glu Arg Gly 
385 390 395 400 

Ser Arg lie Lys Arg Lys Lys Gin Glu Met Lys Lys Arg Lys Thr Arg 
405 410 415 

Gly Arg Cys Glu Val Val He Met Glu Asp Ala Pro Asp Tyr Tyr Ala 
420 425 * 430 

Val Glu Asp He Phe Ser Glu lie Ser Asp He Asp Glu Thr He His 
435 440 445 

Asp He Lys He Ser Asp Phe Met Glu Thr Thr Asp Tyr Phe Glu Thr 
450 455 460 

Thr Asp Asn Glu He Thr Asp He Asn Glu Asn He Cys Asp Ser Glu 
465 470 475 " 480 

Asn Pro Asp His Asn Glu Val Pro Asn Asn Glu Thr Thr Asp Asn Asn 
485 490 495 

Glu Ser Ala Asp Asp His Glu Thr Thr Asp Asn Asn Glu Ser Ala Asp 
500 505 510 

Asp Asn Asn Glu Asn Pro Glu Asp Asn Asn Lys Asn Thr Asp Asp Asn 
515 520 525 

Glu Glu Asn Pro Asn Asn Asn Glu Asn Thr Tyr Gly Asn Asn Phe Phe 
530 535 . 540 

Lys Gly Gly Phe Trp Gly Ser His Gly Asn Asn Gin Asp Ser Ser Asp 
545 550 555 560 

Ser Asp Asn Glu Ala Asp Glu Ala Ser Asp Asp Glu Asp Asn Asp Gly 
565 570 575 

Asn Glu Gly Asp Asn Glu Gly Ser Asp Asp Asp Gly Asn Glu Gly Asp 
580 585 590 

Asn Glu Gly Ser Asp Asp Asp Asp Arg Asp He Glu Tyr Tyr Glu Lys 
595 600 605 

Val He Glu Asp Phe Asp Lys Asp Gin Ala Asp Tyr Glu Asp Val He 
610 615 620 



Glu lie lie Ser Asp Glu Ser Val Glu Glu Glu Gly He Glu Glu Gly 
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625 * 630 635 640 



He Gin Gin Asp Glu Asp He Tyr Glu Glu Gly Asn Tyr Glu Glu Glu 
645 650 655 



Gly Ser Glu Asp Val Trp Glu Glu Gly Glu Asp Ser Asp Asp Ser Asp 
660 665 670 



Leu Glu Asp Val Leu Gin Val Pro Asn Gly Trp Ala Asn Pro Gly Lys 
675 680 * 685 



Arg Gly Lys Thr Gly 
690 



<210> 7 

<211> 2632 

<212> DNA 

<213> NMJ)16463 HSPC195 
<220> 

<221> CDS 

<222> (931) (1611) 

<223> 

<400> 7 

ggcggctgag cctgagcggg gatgtagagg cggcggcagc agaggcggca ctggcggcaa 60 

gagcagacgc ccgagccgag cgagaagagc ggcagagcct tatcccctga agccgggccc 120 

cgcgtcccag ccctgcccag cccgcgccca gccatgcgcg ccgcctgctg agtccgggcg 180 

ccgcacgctg agccctccgc- ccgcgagccg cgctcagctc gggggtgatt agttgctttt 240 

tgttgttttt taatttgggc cgcggggagg gggaggaggg gcaggtgctg caggctcccc 300 

cccctccccg cctcgggcca gccgcggcgg cgcgactcgg gptccggacc cgggcactgc 3 60 

tggcggctgg agcggagcgc accgcggcgg tggt'gcccag agcggagcgc agctccctgc 420 

cccgcccctc cccctcggcc tcgcggcgac ggcggcggtg gcggcttgga cgactcggag 480 

agccgagtga agacatttcc acctggacac ctgaccatgt gcctgccctg agcagcgagg 540 

cccaccaggc atctctgttg tgggcagcag ggccaggtcc tggtctgtgg accctcggca 600 

gttggcaggc tccctctgca gtggggtctg ggcctcggcc ccaccatgtc gagcctcggc 660 

ggtggctccc aggatgccgg cggcagtagc agcagcagca ccaatggcag cggtggcagt 720 

ggcagcagtg gcccaaaggc aggagcagca gacaagagtg cagtggtggc tgccgccgca 780 

ccagcctcag tggcagatga cacaccaccc cccgagcgtc ggaacaagag cggtatcatc 840 
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agtgagcccc tcaacaagag cctgcgccgc tcccgcccgc tctcccacta ctcttctttt 900 

ggcagcagtg gtggtagtgg cggtggcagc atg atg ggc gga gag tct get gac 954 

Met Met Gly Gly Glu Ser Ala Asp 
1 5 

aag gec act gcg get gca gee get. gee tec ctg ttg gee aat ggg cat 1002 
Lys Ala Thr Ala Ala Ala Ala Ala Ala Ser Leu Leu Ala Asn Gly His 
10 15 20 

gac ctg gcg gcg gee atg gcg gtg gac aaa age aac cct ace tea aag 1050 
Asp Leu Ala Ala Ala Met Ala Val Asp Lys Ser Asn Pro Thr Ser Lys 
25 30 " 35 40 

cac aaa agt ggt get gtg gee age ctg ctg age aag gca gag egg gee 109.8 
His Lys Ser Gly Ala Val Ala Ser Leu Leu Ser Lys Ala Glu Arg Ala 
45 50 55 

acg gag ctg gca gee gag gga cag ctg acg ctg cag cag ttt gcg cag 1146 
Thr Glu Leu Ala Ala Glu Gly Gin Leu Thr Leu Gin Gin Phe Ala Gin 
60 65 .70 

tec aca gag atg ctg aag cgc gtg gtg cag. gag cat etc ccg ctg atg 1194 
Ser Thr Glu Met Leu Lys Arg Val Val Gin Glu His Leu Pro Leu Met 
75 80 85 

age gag gcg ggt get ggc ctg cct gac atg gag get gtg gca ggt gee 1242 
Ser Glu Ala Gly Ala Gly Leu Pro Asp Met Glu Ala Val Ala Gly Ala 
90 95 100 

gaa gee etc aat ggc cag tec gac ttc ccc tac ctg ggc get ttc ccc 1290 
Glu Ala Leu Asn Gly Gin Ser Asp Phe Pro Tyr Leu Gly Ala Phe Pro 
105 110 115 120 

ate aac cca ggc etc ttc att atg ace ccg gca ggt gtg ttc ctg gee 1338 
lie Asn Pro Gly Leu Phe lie Met Thr Pro Ala Gly Val Phe Leu Ala 
125 130 135 

gag age gcg ctg cac atg gcg ggc ctg get gag tac ccc atg cag gga 1386 
Glu Ser Ala Leu His Met Ala Gly Leu Ala Glu Tyr Pro Met Gin Gly 
140 145 150 

gag ctg gee tct gee ate age tec ggc aag aag aag egg aaa cgc tgc 1434 
Glu Leu Ala Ser Ala He Ser Ser Gly Lys Lys Lys Arg Lys Arg Cys 
155 160 165 

ggc atg tgc gcg ccc tgc egg egg cgc ate aac tgc gag cag tgc age 1482 
Gly Met Cys Ala Pro Cys Arg Arg Arg lie Asn Cys Glu Gin Cys Ser 
170 175 180 

agt tgt agg aat cga aag act ggc cat cag att tgc aaa ttc aga aaa 1530 
Ser Cys Arg Asn Arg Lys Thr Gly His Gin He Cys Lys Phe Arg Lys 
185 190 195 200 

tgt gag gaa etc aaa aag aag cct tec get get ctg gag aag gtg atg 1578 
Cys Glu Glu Leu Lys Lys Lys Pro Ser Ala Ala Leu Glu Lys Val Met 
205 210 215 

ctt ccg acg gga gee gee ttc egg tgg ttt cag tgaeggegge ggaacccaaa 1631 
Leu Pro Thr Gly Ala Ala Phe Arg Trp Phe Gin 
220 225 

gctgccctct ccgtgcaatg tcactgctcg tgtggtctcc agcaagggat tegggegaag 1691 

acaaaeggat gcacccgtct ttagaaccaa aaatattctc tcacagattt cattcctgtt 1751 

tttatatata tattttttgt tgtcgtttta acatctccac gtccctagca taaaaagaaa 1811 
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aagaaaiaaaa 


tttaaactgc tttttcggaa 


gaacaacaac aaaaaagagg taaagacgaa 


1871 


tctataaagt 


accgagactt cctgggcaaa 


gaatggacaa tcagtttcct tcctgtgtcg • 


1931 
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2171 


I* wwu^ y v»*y ^-M 


i^. ciy dy v- uyuL. ciy^ciyyyciy^ 


L^aL-ayaayy y yciy yy oy v-a uuayyLLayc 




tcracrctcfca.c 

^ w ^— ^ C-l 


r , r*ar*acft*ccc cracract*crcrcra 


trpfrrarrf caar*acft"cra'f* r"H"nrr3aas3 


2291 


aaaatgaaag 


ttctgttcgt ttatccattg 


cgatctgggg agccccatct cgatatttcc 


2351 


aatcctggct 


acttttctta gagaaaataa 


gtcctttttt tctggccttg ctaatggcaa 


2411 


cagaagaaag 


ggcttctttg cgtggtcccc 


tgctggtggg ggtgggtccc cagggggccc 


2471 


cctgcggcct 


gggcccccct gcccacggcc 


agcttcctgc tgatgaacat gctgtttgta 


2531 


ttgttttagg 


aaaccaggct gttttgtgaa 


taaaacgaat gcatgtttgt gtcacgaaaa 


2591 


aaaaaaaaaa 


aaaaaaaaaa aaaaaaaaaa 


aaaaaaaaaa a 


2632 



<210> 8 

<211> 227 
<212> PRT 

<213> NM_0164 63 HSPC195 
<400> 8 

Met Met Gly Gly Glu Ser Ala Asp Lys Ala Thr Ala Ala Ala Ala Ala 
15 10 15 

Ala Ser Leu Leu Ala Asn Gly His Asp Leu Ala Ala Ala Met Ala Val 
20 25 30 

Asp Lys Ser Asn Pro Thr Ser Lys His Lys Ser Gly Ala Val Ala Ser 
35 40 45 

Leu Leu Ser Lys Ala Glu Arg Ala Thr Glu Leu Ala Ala Glu Gly Gin 
50 55 60 

Leu Thr Leu Gin Gin Phe Ala Gin Ser Thr Glu Met Leu Lys Arg Val 
65 70 75 80 

Val Gin Glu His Leu Pro Leu Met Ser Glu Ala Gly Ala Gly Leu Pro 
85 90 95 

Asp Met Glu Ala Val Ala Gly Ala Glu Ala Leu Asn Gly Gin Ser Asp 
100 105 110 
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Phe Pro Tyr Leu Gly Ala Phe Pro He Asn Pro Gly Leu Phe He Met 
115 120 125 



Thr Pro Ala Gly Val Phe Leu Ala Glu Ser Ala Leu His Met Ala Gly 
130 135 140 



Leu Ala Glu Tyr Pro Met Gin Gly Glu Leu Ala Ser Ala He Ser Ser 
145 150 ' 155 160 



Gly Lys Lys Lys Arg Lys Arg Cys Gly Met Cys Ala Pro Cys Arg Arg 
165 170 175 



Arg He Asn Cys Glu Gin Cys Ser Ser Cys Arg Asn Arg Lys Thr Gly 
180 185 190 



His Gin He Cys Lys Phe Arg Lys Cys Glu Glu Leu Lys Lys Lys Pro 
195 200 205 



Ser Ala Ala Leu Glu Lys Val Met Leu Pro Thr Gly Ala Ala Phe Arg 
210 .215 220 



Trp Phe Gin 
225 



<210> 9 

<211> 2140 

<212> DNA 

<213> NM 017697 FLJ20171 



<220> 

<221> CDS 

<222> (58) .. (1131) 

<223> 



<400> 9 

gaattcaaga aatgttgccc tggttcacct gatattgaca aactggacgt tgccaca 57 

atg aca gag tat tta aat ttt gag aag agt agt tea gtc tct cga tat 105 
Met Thr Glu Tyr Leu Asn Phe Glu Lys Ser Ser Ser Val Ser Arg Tyr 
15 10 15 

gga gec tct caa gtt gaa gat atg ggg aat ata att tta gca atg att 153 
Gly Ala Ser Gin Val Glu Asp Met Gly Asn He He Leu Ala Met He 
20 25 30 

tea gag cct tat aat cac agg ttt tea gat cca gag aga gtg aat tac 201 
Ser Glu Pro Tyr Asn His Arg Phe Ser Asp Pro Glu Arg Val Asn Tyr 
35 40 45 
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aag ttt gaa agt gga act tgc age aag atg gaa ctt att gat gat aac 249 
Lys Phe Glu Ser Gly Thr Cys Ser Lys Met Glu Leu He Asp Asp Asn 
50 55 60- 

acc gta gtc agg gca cga ggt tta cca tgg cag tct tea gat caa gat 297 
Thr Val Val Arg Ala Arg Gly Leu Pro Trp Gin Ser Ser Asp Gin Asp 
65 70 75 80 

att gca aga ttc ttc aaa gga etc aat att gee aag gga ggt gca gca 345 
He Ala Arg Phe Phe Lys Gly Leu Asn He Ala Lys Gly Gly Ala Ala 

.85 -90 95 

ctt tgt ctg aat get cag ggt cga agg aac gga gaa get ctg gtt agg 393 
Leu Cys Leu Asn Ala Gin Gly Arg Arg Asn Gly Glu Ala Leu Val Arg 
100 105 110 

ttt gta agt gag gag cac cga gac eta gca eta cag agg cac aaa cat 441 
Phe Val Ser Glu Glu His Arg Asp Leu Ala Leu Gin Arg His Lys His 
115 120 125 

cac atg ggg acc egg tat att gag gtt tac aaa gca aca ggt gaa gat 489 
His Met Gly Thr Arg Tyr He Glu Val Tyr Lys Ala Thr Gly Glu Asp 
130 135 140 

ttc ctt aaa att get ggt ggt act tec aat gag gta gee cag ttt etc 537 
Phe Leu Lys He Ala Gly Gly Thr Ser Asn Glu Val Ala Gin Phe Leu 
145 150 155 160 

tec aag gaa aat caa gtc att gtt cgc atg egg ggg etc cct ttc acg 585 
Ser Lys Glu Asn Gin Val He Val Arg Met Arg Gly Leu Pro Phe Thr 
165 170 175 

gee aca get gaa gaa gtg gtg gee ttc ttt gga cag cat tgc cct att 633 
Ala Thr Ala Glu Glu Val Val Ala Phe Phe Gly Gin His Cys Pro He 
180 185 190 

act ggg gga aag gaa ggc ate etc ttt gtc acc tac cca gat ggt agg 681 
Thr Gly Gly Lys Glu Gly He Leu Phe Val Thr Tyr Pro Asp Gly Arg 
195 200 205 

cca aca ggg gac get ttt gtc etc ttt gee tgt gag gaa tat gca cag 729 
Pro Thr Gly Asp Ala Phe Val Leu Phe Ala Cys Glu Glu Tyr Ala Gin 
210 215 220 

aat gcg ttg agg aag cat aaa gac ttg ttg ggt aaa aga tac att gaa 777 
Asn Ala Leu Arg Lys His Lys Asp Leu Leu Gly Lys Arg Tyr He Glu 
225 230 235 240 

etc ttc agg age aca gca get gaa gtt cag cag gtg ctg aat cga ttc 825 
Leu Phe Arg Ser Thr Ala Ala Glu Val Gin Gin Val Leu Asn Arg Phe 
245 250 255 

tec teg gec cct etc att cca ctt cca acc cct ccc att att cca gta 873 
Ser Ser Ala Pro Leu He Pro Leu Pro Thr Pro Pro He He Pro Val 
260 265 270 

eta cct cag caa ttt gtg ccc cct aca aat gtt aga gac tgt ata cgc 921 
Leu Pro Gin Gin Phe Val Pro Pro Thr Asn Val Arg Asp Cys He Arg 
275 280 285 

ctt cga ggt ctt ccc tat gca gee aca att gag gac ate ctg gat ttc 969 
Leu Arg Gly Leu Pro Tyr Ala Ala Thr He Glu Asp lie Leu Asp Phe 
290 295 300 

ctg ggg gag ttc gee aca gat att cgt act cat ggg gtt cac atg gtt 1017 
Leu Gly Glu Phe Ala Thr Asp He Arg Thr His Gly Val His Met Val 
305 310 315 320 
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ttg aat cac cag ggc cgc cca tea gga gat gec ttt ate cag atg aag 1065 
Leu Asn His Gin Gly Arg Pro Ser Gly Asp Ala Phe He Gin Met Lys 
325 330 335 

tct gcg gac aga gca ttt atg get gca cag aag tgt cat aaa.aaa aaa 1113 
Ser Ala Asp Arg Ala Phe Met Ala Ala Gin Lys Cys His Lys Lys Lys 
340 345- 350 

cat gaa gga cag ata tgt tgaagtcttt cagtgttcag ctgaggagat 1161 
His Glu Gly Gin He Cys 
355 



gaactttgtg 


ttaatggggg 


gcactttaaa tcgaaatggc ttatccccac cgccatgtaa 


1221 


gttaccatgc 


ctgtctcctc 


cctcctacac atttccagct cctgctgcag ttattcctac 


1281 


agaagctgee 


atttaccagc 


cctctgtgat tttgaatcca cgagcactgc agccctccac 


1341 


agegtactae 


ccagcaggca 


ctcagctctt catgaactac acagegtact atcccagccc 


1401 


cccaggttcg 


cctaatagtc 


ttggctactt ccctacagct gctaatctta gcggtgtccc 


1461 


tccacagcct 


ggcacggtgg 


teagaatgea gggcctggcc tacaatactg gagttaagga 


1521 


aattcttaac 


ttcttccaag 


gttaccagtg tttgaaagat gtatggtgat cttgaaacct 


1581 


ccagacacaa 


gaaaacttct 


agcaaattca ggggaagttt gtctacactc aggctgeagt 


164.1 


attttcagca 


aacttgattg 


gaeaaaeggg cctgtgcctt atcttttggt ggagtgaaaa 


1701 


agtttgagct 


agtgaagcca 


aategtaact tacagcaagc ageatgeage atacctggct 


1761 


etttgetgat 


tgcaaatagg 


catttaaaat gtgaatttgg aatcagatgt ctccattact 


1821 


tccagttaaa 


gtggcatcat 


aggtgtttcc taagttttaa gtcttggata aaaactccac 


1881 


cagtgtctac 


catctccacc 


atgaactctg ttaaggaagc ttcatttttg tatattcccg 


1941 


ctcttttctc 


ttcatttccc 


tgtcttctgc ataatcatgc ettcttgeta agtaattcaa ■ 


2001 


gcataagatc 


ttggaataat 


aaaatcacaa tcttaggaga aagaataaaa ttgttatttt 


2061 


cccagtctct 


tggecatgat 


gatatcttat gattaaaaac aaattaaatt ttaaaacacc 


2121 


tgaaaaaaaa 


aaaaaaaaa 




2140 



<210> 10 

<211> 358 

<212> PRT 

<213> NMJ>17697 FLJ20171 

<400> 10 

Met Thr Glu Tyr Leu Asn Phe Glu Lys Ser Ser Ser Val Ser Arg Tyr 
1 5 10 . 15 



Gly Ala Ser Gin Val Glu Asp Met Gly Asn He He Leu Ala Met He 
20 25 30 
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Ser Glu Pro Tyr Asn His Arg Phe Ser Asp Pro Glu Arg Val Asn Tyr 
35 40 45 



Lys Phe Glu Ser Gly Thr Cys Ser Lys Met Glu Leu lie Asp Asp Asn 
50 55 . 60 



Thr Val Val Arg Ala Arg Gly Leu Pro Trp Gin Ser Ser Asp Gin Asp 
65 70 75 80 



lie Ala Arg Phe Phe Lys Gly Leu Asn lie Ala Lys Gly Gly Ala Ala 
85 90 > 95 



Leu Cys Leu Asn Ala Gin Gly Arg Arg Asn Gly Glu Ala Leu Val Arg 
100 . 105 110 



Phe Val Ser Glu Glu His Arg Asp Leu Ala Leu Gin Arg His Lys His 
115 120 125 



His Met Gly Thr Arg Tyr lie Glu Val Tyr Lys Ala Thr Gly Glu Asp 
130 135 140 



Phe Leu Lys He Ala Gly Gly Thr Ser Asn Glu Val Ala Gin Phe Leu 
145 150 155 160 



Ser Lys Glu Asn Gin Val He Val Arg Met Arg Gly Leu Pro Phe Thr 
165 170 175 



Ala Thr Ala Glu Glu Val Val Ala Phe Phe Gly Gin His Cys Pro He 
180 185 * 190 



Thr Gly Gly Lys Glu Gly He Leu Phe Val Thr Tyr Pro Asp Gly Arg 
195 200 205 



Pro Thr Gly Asp Ala Phe Val Leu Phe Ala Cys Glu Glu Tyr Ala Gin 
210 215 220 



Asn Ala Leu Arg Lys His Lys Asp Leu Leu Gly Lys Arg Tyr He Glu 
225 230 235 240 



Leu Phe Arg Ser Thr Ala Ala Glu Val Gin Gin Val Leu Asn Arg Phe 
245 250 255 



Ser Ser Ala Pro Leu He Pro Leu Pro Thr Pro Pro He He Pro Val 
260 265 270 



Leu Pro Gin Gin Phe Val Pro Pro Thr Asn Val Arg Asp Cys He Arg 
275 280 285 



Leu Arg Gly Leu Pro Tyr Ala Ala Thr He Glu Asp He Leu Asp Phe 
290 295 300 
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Leu Gly Glu Phe Ala Thr Asp He Arg Thr His Gly Val His Met Val 
305 . 310 315 320 



Leu Asn His Gin Gly Arg Pro Ser Gly Asp Ala Phe He Gin Met Lys 
325 330 335 

Ser Ala Asp Arg Ala Phe Met Ala Ala Gin Lys Cys His Lys Lys Lys 
340 345 350 

His Glu Gly Gin He Cys 
355 



<210> 11 

<211> 2808 

<212> DNA 

<213> NM 052886 MAL2 



<220> 

<221> CDS 

<222> {80).. (607) 

<223> 



<400> 11 

ggcggcggcg gcaggagccc gggaggcgga ggcgggaggc ggcggcggcg cgcggagacg 60 

cagcagcggc agcggcagc atg teg gee ggc gga gcg tea gtc ccg ccg ccc 112 
Met Ser Ala Gly Gly Ala Ser Val Pro Pro Pro 
15 10 

ccg aac ccc gec gtg tec ttc ccg ccg ccc egg gtc ace ctg ccc gee 160 
Pro Asn Pro Ala Val Ser Phe Pro Pro Pro Arg Val Thr Leu Pro Ala 
15 • 20 25 

ggc ccc gac ate ctg egg ace tac teg ggc gee ttc gtc tgc ctg gag 208 
Gly Pro Asp He Leu Arg Thr Tyr Ser Gly Ala Phe Val Cys Leu Glu 
30 35 40 

att ctg ttc ggg ggt ctt gtc tgg att ttg gtt gec tec tec aat gtt 256 
He Leu Phe Gly Gly Leu Val Trp He Leu Val Ala Ser Ser Asn Val 
45 * 50 55 

cct eta cct eta eta caa gga tgg gtc atg ttt gtg tec gtg aca gcg 304 
Pro Leu Pro Leu Leu Gin Gly Trp Val Met Phe Val Ser Val Thr Ala 
60 65 70 75 

ttt ttc ttt teg etc etc ttt ctg ggc atg ttc etc tct ggc atg gtg 352 
Phe Phe Phe Ser Leu Leu Phe Leu Gly Met Phe Leu Ser Gly Met Val 
80 85 90 

get caa att gat get aac tgg aac ttc ctg gat ttt gee tac cat ttt 400 
Ala Gin lie Asp Ala Asn Trp Asn Phe Leu Asp Phe Ala Tyr His Phe 
95 100 105 
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aca gta ttt gtc ttc tat ttt gga gcc ttt tta ttg gaa gca gca gcc 448 
Thr Val Phe Val Phe Tyr Phe Gly Ala Phe Leu Leu Glu Ala Ala Ala 
110 115 % 120 

aca tec ctg cat gat ttg cat tgc aat.aca acc ata acc ggg cag cca 496 
Thr Ser Leu His Asp Leu His Cys Asn Thr Thr lie Thr Gly Gin Pro 
125 130 135 

etc ctg agt gat aac cag tat aac ata aac gta gca gcc tea att ttt 544 
Leu Leu Ser Asp Asn Gin Tyr Asn lie Asn Val Ala Ala Ser He Phe 
140 145 150 155 

gcc ttt atg acg aca get tgt tat ggt tgc agt ttg ggt ctg get tta 592 
Ala Phe Met Thr Thr Ala Cys Tyr Gly Cys Ser Leu Gly Leu Ala Leu 
160 165 170 

cga aga tgg cga ccg taacactcct tagaaactgg cagtegtatg ttagtttcac 647 
Arg Arg Trp Arg Pro 
175 



ttgtctactt 


tatatgtctg 


atcaatttgg 


ataccatttt 


gtccagatgc 


aaaaacattc 


707 


caaaagtaat 


gtgtttagta 


gagagagact 


ctaagctcaa 


gttctggttt 


atttcatgga 


767 


tggaatgtta 


attttattat 


gatattaaag 


aaatggcctt 


ttattttaca 


tctctcccct 


827 


ttttcccttt 


ccccctttat 


tttcctcctt 


ttctttctga 


aagtttcctt 


ttatgtccat 


887 


aaaatacaaa 


tatattgttc 


ataaaaaatt 


agtatccctt 


ttgtttggtt 


gctgagtcac 


947 


ctgaacctta 


attttaattg 


gtaattacag 


cccctaaaaa 


aaacacattt 


caaataggct 


1007 


tcccactaaa 


ctctatattt 


tagtgtaaac 


caggaattgg 


cacacttttt 


ttagaatggg 


1067 


ccagatggta 


aatatttatg 


cttcacggtc 


catacagtct 


ctgtcacaac 


tattcagttc 


1127 


tgctagtata 


gcgtgaaagc 


agctatacac 


aatacagaaa 


tgaatgagtg 


tggttatgtt 


1187 


ctaataaaac 


ttatttataa 


aaacaagggg 


aggctgggtt 


tagcctgtgg 


gccatagttt 


1247 


gtcaaccact 


ggtgtaaaac 


cttagttata 


tatgatctgc 


attttcttga 


actgatcatt 


1307 


gaaaacttat 


aaacctaaca 


gaaaagecac 


ataatattta 


gtgtcattat 


gcaataatca 


1367 


cattgecttt 


gtgttaatag 


tcaaatactt 


acctttggag. 


aatacttacc 


tttggaggaa 


1427 


tgtataaaat 


ttctcaggca 


gagtcctgga 


tataggaaaa 


agtaatttat 


gaagtaaact 


1487 


teagttgett 


aatcaaacta 


atgatagtct 


aacaactgag 


caagatcctc 


atctgagagt 


1547 


gcttaaaatg 


ggatccccag 


agaccattaa 


ccaatactgg 


aactggtatc 


tagctactga 


1607 


tgtcttactt 


tgagtttatt 


tatgettcag 


aatacagttg 


tttgccctgt 


gcatgaatat 


1667 


acccatattt 


gtgtgtggat 


atgtgaagct 


tttccaaata 


gagctctcag 


aagaattaag 


1727 


tttttacttc 


taattatttt 


gcattacttt 


gagttaaatt 


tgaatagagt 


attaaatata 


1787 


aagttgtaga 


ttcttatgtg 


tttttgtatt 


agcccagaca 


tctgtaatgt 


ttttgcactg 


1847 


gtgacagaca 


aaatctgttt 


taaaatcata 


tccagcacaa 


aaactatttc 


tggctgaata 


1907 


gcacagaaaa 


gtattttaac 


ctacctgtag 


agatcctcgt 


catggaaagg 


tgccaaactg 


1967 


ttttgaatgg 


aaggacaagt 


aagagtgagg 


ccacagttcc 


caccacacga 


gggcttttgt 


2027 


attgttctac 


tttttcagee 


ctttactttc 


tggctgaagc 


atccccttgg 


agtgccatgt 


2087 
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ataagttggg 


ctattagagt 


tcatggaaca 


tagaacaacc atgaatgagt 


ggcatgatcc 


2147 


gtgcttaatg 


atcaagtgtt 


acttatctaa 


taatcctcta' gaaagaaccc 


tgttagatct 


2207 


tggtttgtga 


taaaaatata 


aagacagaag 


acatgaggaa aaacaaaagg 


tttgaggaaa 


2267 


tcaggcatat 


gactttatac 


ttaacatcag 


atcttttcta taatatccta 


ctactttggt 


2327 


tttcctagct 


ccataccaca 


cacctaaacc 


tgtattatga attacatatt 


acaaagtcat 


2387 


aaatgtgcca 


tatggatata 


cagtacattc 


tagttggaat cgtttactct 


gctagaattt 


2447 


aggtgtgaga 


ttttttgttt 


cccaggtata 


gcaggcttat gtttggtggc 


attaaattgg 


2507 


tttctttaaa 


atgctttggt 


ggcacttttg 


taaacagatt gcttctagat 


tgttacaaac 


2567 


caagcctaag 


acacatctgt 


gaatacttag 


atttgtagct taatcacatt 


ctagacttgt 


2627 


gagttgaatg 


acaaagcagt 


tgaacaaaaa 


ttatggcatt taagaattta 


acatgtctta 


2687 


gctgtaaaaa 


tgagaaagtg 


ttggttggtt 


ttaaaatctg gtaactccat 


gatgaaaaga 


2747 


aatttatttt 


atacgtgtta 


tgtctctaat 


aaagtattca tttgataaaa 


aaaaaaaaaa 


2807 


a 










2808 



<210> 12 

<211> 176 

<212> PRT 

<213> NMJ>52886 MAL2 

<400> 12 

Met Ser Ala Gly Gly Ala Ser Val Pro Pro Pro Pro Asn Pro Ala Val 
1 5 10 15 

Ser Phe Pro Pro Pro Arg Val Thr Leu Pro Ala Gly Pro Asp He Leu 
20 25 30 

Arg Thr Tyr Ser Gly Ala Phe Val Cys Leu Glu He Leu Phe Gly Gly 
35 40 45 

Leu Val Trp He Leu Val Ala Ser Ser Asn Val Pro Leu Pro Leu Leu 
50 55 60 

Gin Gly Trp Val Met Phe Val Ser Val Thr Ala Phe Phe Phe Ser Leu 
65 70 75 80 

Leu Phe Leu Gly Met Phe Leu Ser Gly Met Val Ala Gin He Asp Ala 
85 90 95 

Asn Trp Asn Phe Leu Asp Phe Ala Tyr His Phe Thr Val Phe Val Phe 
100 105 110 

Tyr Phe Gly Ala Phe Leu Leu Glu Ala Ala Ala Thr Ser Leu His Asp 
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115 120 125 



Leu His Cys Asn Thr Thr lie Thr Gly Gin Pro Leu Leu Ser Asp Asn 
130 135 140 



Gin Tyr Asn lie Asn Val Ala Ala Ser He Phe Ala Phe Met Thr Thr 
145 150 155 160 



Ala Cys Tyr Gly Cys Ser Leu Gly Leu Ala Leu Arg Arg Trp Arg Pro 
165 170 175 



<210> 13 

<211> 4171 

<212> DNA 

<213> NM_015238 KIAA0869, KIBRA 

<220> 

<221> CDS 

<222> (67) . . (3405) 

<223> 



<400> 13 

ctggagccgc tgagcccccg ctgcggccgg gagctgcatg ggggagcgcc ggcagcgctt 60 

gggaag atg ccc egg ccg gag ctg ccc ctg ccg gag ggc tgg gag gag 108 
Met Pro Arg Pro Glu Leu Pro Leu Pro Glu Gly Trp Glu Glu 
1 5 .10 

gcg cgc gac ttc gac ggc aag gtc tac tac ata gac cac acg aac cgc 156 
Ala Arg Asp Phe Asp Gly Lys Val Tyr Tyr He Asp His Thr Asn Arg 
15 20 25 30 

acc acc age tgg ate gac ccg egg gac agg tac acc aaa ccg etc acc 204 
Thr Thr Ser Trp He Asp Pro Arg Asp Arg Tyr Thr Lys Pro Leu Thr 
35 40 45 



ttt get gac tgc att agt gat gag ttg ccg eta gga tgg gaa gag gca 252 
Phe Ala Asp Cys He Ser Asp Glu Leu Pro Leu Gly Trp Glu Glu Ala 
50 55 60 

tat gac cca cag gtt gga gat tac ttc ata gac cac aac acc aaa acc 300 
Tyr Asp Pro Gin Val Gly Asp Tyr Phe He Asp His Asn Thr Lys Thr 
65 * 70 75 

act cag att gag gat cct cga gta caa tgg egg egg gag cag gaa cat 348 
Thr Gin He Glu Asp Pro Arg Val Gin Trp Arg Arg Glu Gin Glu His 
80 85 90 

atg ctg aag gat tac ctg gtg gtg gec cag gag get ctg agt gca caa 396 
Met Leu Lys Asp Tyr Leu Val Val Ala Gin Glu Ala Leu Ser Ala Gin 
95 100 105 110 

aag gag ate tac cag gtg aag cag cag cgc ctg gag ctt gca cag cag 444 
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Lys Glu He Tyr Gin Val Lys Gin Gin Arg Leu Glu Leu Ala Gin Gin 
115 .120 125 

gag tac cag caa ctg cat gcc gtc tgg gag cat aag ctg ggc tec cag 4 92 

Glu Tyr Gin Gin Leu His Ala Val Trp Glu His Lys Leu Gly Ser .Gin 
130 135. 140 

gtc age ttg gtc tct ggt tea tea tec age tec aag tat gac cct gag 540 

Val Ser Leu Val Ser Gly Ser Ser Ser Ser Ser Lys Tyr Asp Pro Glu 

145 150 155 

ate ctg aaa get gaa att gcc act gca aaa tec egg gtc aac aag ctg 588 

He Leu Lys Ala Glu He Ala Thr Ala Lys Ser Arg Val Asn Lys Leu 
160 165 170 



aag aga gag atg gtt cac etc cag cac gag ctg cag ttc aaa gag cgt 636 
Lys Arg Glu Met Val His Leu Gin His Glu Leu Gin Phe Lys Glu Arg 
175 180 185 190 

ggc ttt cag ace ctg aag aaa ate gat, aag aaa atg tct gat get cag 684 
Gly Phe Gin Thr Leu Lys Lys He Asp Lys Lys Met Ser Asp Ala Gin 
195 200 205 

ggc age tac aaa ctg gat gaa get cag get gtc ttg aga gaa aca aaa 732 
Gly Ser Tyr Lys Leu Asp Glu Ala Gin Ala Val Leu Arg Glu Thr Lys 
210 215 220 

gcc ate aaa aag get att acc tgt ggg gaa aag gaa aag caa gat etc . 780 
Ala He Lys Lys Ala He Thr Cys Gly Glu Lys Glu Lys Gin Asp Leu 
225 230 235 

att aag age ctt gcc atg ttg aag gac ggc ttc cgc act gac agg ggg 828 
He Lys Ser Leu Ala Met Leu Lys Asp Gly Phe Arg Thr Asp Arg Gly 
240 245 " 250 

tct cac tea gac ctg tgg tec age age age tct ctg gag agt teg agt 876 
Ser His Ser Asp Leu Trp Ser Ser Ser Ser Ser Leu Glu Ser Ser Ser 
255 260 265 270 

ttc ccg eta ccg aaa cag tac ctg gat gtg age tec cag aca gac ate 924 
Phe Pro Leu Pro Lys Gin Tyr Leu Asp Val Ser Ser Gin Thr Asp He 
275 280 . 285 

teg gga age ttc ggc ate aac age aac aat cag ttg gca gag aag gtc 972 
Ser Gly Ser Phe Gly He Asn Ser Asn Asn Gin Leu Ala Glu Lys Val 
290 295 300 



aga ttg cgc ctt cga tat gaa gag get aag aga agg ate gcc aac ctg 1020 
Arg Leu Arg Leu Arg Tyr Glu Glu Ala Lys Arg Arg He Ala Asn Leu 
305 310 315 

aag ate cag ctg gcc aag ctt gac agt gag gcc tgg cct ggg gtg ctg 1068 
Lys He Gin Leu Ala Lys Leu Asp Ser Glu Ala Trp Pro Gly Val Leu 
320 325 330 

gac tea gag agg gac egg ctg ate ctt ate aac gag aag gag gag ctg 1116 
Asp Ser Glu Arg Asp Arg Leu lie Leu He Asn Glu Lys Glu Glu Leu 
335 340 345 350 

ctg aag gag atg cgc ttc ate age ccc cgc aag tgg acc cag ggg gag 1164 
Leu Lys Glu Met Arg Phe He Ser Pro Arg Lys Trp Thr Gin Gly Glu 
355 360 • 365 

gtg gag cag ctg gag atg gcc egg aag egg ctg gaa aag gac ctg cag 1212 
Val Glu Gin Leu Glu Met Ala Arg Lys Arg Leu Glu Lys Asp Leu Gin 
370 375 380 
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gca gcc egg gac acc cag age aag gcg ctg acg gag agg tta aag tta 
Ala Ala Arg Asp Thr Gin Ser Lys Ala Leu Thr Glu Arg Leu Lys Leu 

385 390 ■ • 395 

aac agt aag agg aac cag ctt gtg aga gaa ctg gag gaa gcc acc egg 
Asn Ser Lys Arg Asn Gin Leu Val Arg Glu Leu Glu Glu Ala Thr Arg 
400 405 410 

cag gtg gca act ctg cac tec cag ctg aaa agt etc tea age age atg 
Gin Val Ala Thr Leu His Ser Gin Leu. Lys. Ser Leu Ser Ser Ser Met 
415 420 ~ 425 430 



cag tec ctg tec tea ggc age age ccc gga tec etc acg tec age egg 
Gin Ser Leu Ser Ser Gly Ser Ser Pro Gly Ser Leu Thr Ser Ser Arg 
435 440 445 

ggc tec ctg gtt gca tec age ctg gac tec tec act tea gcc age ttc 
Gly Ser Leu Val Ala Ser Ser Leu Asp Ser Ser Thr Ser Ala Ser Phe 
450 455 . . 460 

act gac etc tac tat gac ccc ttt gag cag ctg. gac tea gag ctg cag 
Thr Asp Leu Tyr Tyr Asp Pro Phe Glu Gin Leu Asp Ser Glu Leu Gin 
465 470 475 

age aag gtg gag ttc ctg etc ctg gag ggg gcc acc ggc ttc egg ccc 
Ser Lys Val Glu Phe Leu Leu Leu Glu Gly Ala Thr Gly Phe Arg Pro 
480 485 490 . 

tea ggc tgc ate acc acc ate cac gag gat gag gtg gcc aag acc cag 
Ser Gly Cys He Thr Thr He His Glu Asp Glu Val Ala Lys Thr Gin 
495 500 505 510 

aag gca gag gga ggt ggc cgc ctg cag get ctg cgt tec ctg tct ggc 
Lys Ala Glu Gly Gly Gly Arg Leu Gin Ala Leu Arg Ser Leu Ser Gly 
515 520 525 

acc cca aag tec atg acc tec eta tec' cca cgt tec tct etc tec tec 
Thr Pro Lys Ser Met Thr Ser Leu Ser Pro Arg Ser Ser Leu Ser Ser 
530 535 54 0 

ccc tec cca ccc tgt tec cct etc atg get gac ccc etc ctg get ggt 
Pro Ser Pro Pro Cys Ser Pro Leu Met Ala Asp Pro Leu Leu Ala Gly 
545 550 555 



gat gcc ttc etc aac tec ttg gag ttt gaa gac ccg gag ctg agt gcc 
Asp Ala Phe Leu Asn Ser Leu Glu Phe Glu Asp Pro Glu Leu Ser Ala 
560 565 570 

act ctt tgt gaa ctg age ctt ggt aac age gcc cag gaa aga tac egg 
Thr Leu Cys Glu Leu Ser Leu Gly Asn Ser Ala Gin Glu Arg Tyr Arg 
575 580 585 590 

ctg gag gaa cca gga acg gag ggc aag cag ctg ggc caa get gtg aat 
Leu Glu Glu Pro Gly Thr Glu Gly Lys Gin Leu Gly Gin Ala Val Asn 
595 600 605 

acg gcc cag ggg tgt ggc ctg aaa gtg gcc tgt gtc tea gcc gcc gta 
Thr Ala Gin Gly Cys Gly Leu Lys Val Ala Cys Val Ser Ala Ala Val 
610 615 620 

teg gac gag tea gtg get gga gac agt ggt gtg tac gag get tec gtg 
Ser Asp Glu Ser Val Ala Gly Asp Ser Gly Val Tyr Glu Ala Ser Val 
625 630 635 

cag aga ctg ggt get tea gaa get get gca ttt gac agt gac gaa teg 



1260 
1308 
1356 

1404 

1452 

1500 

1548 

1596 

1644 

1692 

1740 

1788 
1836 
1884 
1932 
1980 
2028 
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Gin Arg Leu Gly Ala Ser Glu Ala Ala Ala Phe Asp Ser Asp Glu Ser 
640 645 650 

gaa gca gtg ggt gcg acc cga att cag att gcc ctg aag tat gat gag 2076 

Glu Ala Val Gly Ala Thr Arg lie Gin lie Ala Leu Lys Tyr Asp Glu 

655 660 ' 665 670 

aag aat aag caa ttt gca ata tta ate ate cag ctg: agt aac ctt tct 2124 

Lys Asn Lys Gin Phe Ala lie Leu He He Gin Leu Ser Asn Leu Ser 
675 680 685 



get ctg ttg cag caa caa gac cag aaa gtg aat ate cgc gtg get gtc 2172 
Ala Leu Leu Gin Gin Gin Asp Gin Lys Val Asn He Arg Val Ala Val 
690 695 700 

ctt cct tgc tct gaa age aca acc tgc ctg ttc egg acc egg cct ctg 2220 
Leu Pro Cys Ser Glu Ser Thr Thr Cys Leu Phe Arg Thr Arg Pro Leu 
705 710 715 

gac gcc tea gac act eta gtg ttc aat gag gtg ttc tgg gta tec atg 2268 
Asp Ala Ser Asp Thr Leu Val Phe Asn Glu Val Phe Trp Val Ser Met 
720 725 730 

tec tat cca gcc ctt cac cag aag acc tta aga gtc gat gtc tgt acc 2316 
Ser Tyr Pro Ala Leu His Gin Lys Thr Leu Arg Val Asp Val Cys Thr 
735 740 745 750 

acc gac agg age cat ctg gaa gag tgc ctg gga ggc gcc cag ate age 2364 
Thr Asp Arg Ser His Leu Glu Glu Cys Leu Gly Gly Ala Gin He Ser 
755 760 765 

ctg gcg gag gtc tgc egg tct ggg gag agg teg act cgc tgg tac aac 2412 
Leu Ala Glu Val Cys Arg Ser Gly Glu Arg Ser Thr Arg Trp Tyr Asn 
770 775 780 

ctt etc age tac aaa tac ttg aag aag cag age agg gag etc aag cca 24 60 
Leu Leu Ser Tyr Lys Tyr Leu Lys Lys Gin Ser Arg Glu Leu Lys Pro 
785 790 795 

gtg gga gtt atg gcc cct gcc tea ggg cct gcc age acg gac get gtg 2508 
Val Gly Val Met Ala Pro Ala Ser Gly Pro Ala Ser Thr Asp Ala Val 
800 805 810 



tct get ctg ttg gaa cag aca gca gtg gag ctg gag aag agg cag gag 2556 
Ser Ala Leu Leu Glu Gin Thr Ala Val Glu Leu Glu Lys Arg Gin Glu 
815 820 825 830 

ggc agg age age aca cag aca ctg gaa gac age tgg agg tat gag gag 2604 
Gly Arg Ser Ser Thr Gin Thr Leu Glu Asp Ser Trp Arg Tyr Glu Glu 
835 840 " 845 

acc agt gag aat gag gca gta gcc gag gaa gag gag gag gag gtg gag 2652 
Thr Ser Glu Asn Glu Ala Val Ala Glu Glu Glu Glu Glu Glu Val Glu 
850 855 860 

gag gag gag gga gaa gag gat gtt ttc acc gag aaa gcc tea cct gat 2700 
Glu Glu Glu Gly Glu Glu Asp Val Phe Thr Glu Lys Ala Ser Pro Asp 
865 870 875 

atg gat ggg tac cca gca tta aag gtg gac aaa gag acc aac acg gag 2748 
Met Asp Gly Tyr Pro Ala Leu Lys Val Asp Lys Glu Thr Asn Thr Glu 
880 885 890 

acc ccg gcc cca tec ccc aca gtg gtg cga cct aag gac egg aga gtg 2796 
Thr Pro Ala Pro Ser Pro Thr Val Val Arg Pro Lys Asp Arg Arg Val 
895 900 905 910 
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ggc acc ccg tec cag ggg cca ttt ctt cga ggg age acc ate ate cgc 2844 
Gly Thr Pro Ser Gin Gly Pro Phe Leu Arg Gly Ser Thr lie lie Arg 
915 . 920 925 

tct aag acc ttc tec cca gga ccc cag age cag tac gtg tgc egg ctg 2892 
Ser Lys Thr Phe Ser Pro Gly Pro Gin Ser Gin Tyr Val Cys Arg Leu 
930 935 940 



aat egg agt gat agt gac age tec act ctg tec aaa aag cca cct ttt 2940 
Asn Arg Ser Asp Ser Asp Ser Ser Thr Leu Ser Lys Lys Pro Pro Phe 
945 950 '• 955 

' gtt cga aac tec ctg gag cga cgc age gtc egg atg aag egg cct tec 2988 
Val Arg Asn Ser Leu Glu Arg Arg Ser Val Arg Met Lys Arg Pro Ser 
960 965 970 

teg gtc aag teg ctg cgc tec gag cgt ctg ate cgt acc teg ctg gac 3036 
Ser Val Lys Ser Leu Arg Ser Glu Arg Leu lie Arg Thr Ser Leu Asp 
975 "* 980 985 990 

ctg gag tta gac ctg cag gcg aca aga acc tgg cac age caa ctg acc 3084 
Leu Glu Leu Asp Leu Gin Ala Thr Arg Thr Trp His Ser Gin Leu Thr 
995 1000 - 1005 

cag gag ate teg gtg ctg aag gag etc aag gag cag ctg gaa caa 3129 
Gin Glu He Ser Val Leu Lys Glu Leu Lys Glu Gin Leu Glu Gin 
1010 1015 1020 

gee aag age cac ggg gag aag gag ctg cca cag tgg ttg cgt gag 3174 
Ala Lys Ser His Gly Glu Lys Glu Leu Pro Gin Trp Leu Arg Glu 
1025 1030 1035 

gac gag cgt ttc cgc ctg ctg ctg agg atg ctg gag aag egg cag 3219 
Asp Glu Arg Phe Arg Leu Leu Leu Arg Met Leu Glu Lys Arg Gin 
1040 1045 1050 

atg gac cga gcg gag cac aag ggt gag ctt cag aca gac aag atg 3264 
Met Asp Arg Ala Glu His Lys Gly Glu Leu Gin Thr Asp Lys Met 
1055 1060 1065 



atg agg gca get gee aag gat gtg cac agg etc cga ggc cag age 3309 
Met Arg Ala Ala Ala Lys Asp Val His Arg Leu Arg Gly Gin Ser 
1070 ~ 1075 . 1080 

tgt aag gaa ccc cca gaa gtt cag tct ttc agg gag aag atg gca 3354 
Cys Lys Glu Pro Pro Glu Val Gin Ser Phe Arg Glu Lys Met Ala 
1085 1090 1095 

ttt ttc acc egg cct egg atg aat ate cca get etc tct gca gat 3399 
Phe Phe Thr Arg Pro Arg Met Asn He Pro Ala Leu Ser Ala Asp 
1100 1105 • 1110 

gac gtc taatcgccag aaaagtattt cctttgttcc actgaccagg ctgtgaacat 3455 
Asp Val 

tgactgtggc taaagttatt tatgtggtgt tatatgaagg tactgagtca caagtcctct 3515 

agtgctcttg ttggtttgaa gatgaaccga ctttttagtt tgggtcctac tgttgttatt 3575 

aaaaacagaa caaaaacaaa acacacacac acacaaaaac agaaacaaaa aaaaccagca 3635 

ttaaaataat aagattgtat agtttgtata tttaggagtg tatttttggg aaagaaaatt 3695 

taaatgaact aaagcagtat tgagttgctg ctcttcttaa aatcgtttag attttttttg 3755 
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gtttgtacag ctccaccttt tagaggtctt actgcaataa- gaagtaatgc ctgggggacg 3815 

gtaatcctaa taggacgtcc cgcacttgtc acagtacagc taatttttcc tagttaacat 3875 

attttgtaca atattaaaaa aatgcacaga aaccattggg ggggattcag aggtgcatcc 3935 

acggatcttc ttgagctgtg acgtgttttt atgtggctgc ccaacgtgga gcgggcagtg 3995 

tgataggctg ggtgggctaa gcagcctagt ctatgtgggt gacaggccac gctggtctca 4055 

gatgcccagt gaagccacta acatgagtga ggggagggct gtggggaact ccattcagtt 4115 

ttatctccat caataaagtg gcctttcaaa aagaaaaaaa aaaaaaaaaa aaaaaa 4171 

<210> 14 

<211> 1113 

<212> PRT 

<213> NM_015238 KIAA0869, KIBRA 

<400> 14 

Met Pro Arg Pro Glu Leu Pro Leu Pro Glu Gly Trp Glu Glu Ala Arg 
1 5 10 " 15 

Asp Phe Asp Gly Lys Val Tyr Tyr lie Asp His Thr Asn Arg Thr Thr 
20 "* 25 30 

Ser Trp He Asp Pro Arg Asp Arg Tyr Thr Lys Pro Leu Thr Phe Ala 
35 40 45 

Asp Cys He Ser Asp Glu Leu Pro Leu Gly Trp Glu Glu Ala Tyr Asp 
50 55 60 

Pro Gin Val Gly Asp Tyr Phe He Asp His Asn Thr Lys Thr Thr Gin 
65 70 75 " 80 

He Glu Asp Pro Arg Val Gin Trp Arg Arg Glu Gin Glu His Met Leu 
85 90 95 

Lys Asp Tyr Leu Val Val Ala Gin Glu Ala Leu Ser Ala Gin Lys Glu 
100 105 110 

He Tyr Gin Val Lys Gin Gin Arg Leu Glu Leu Ala Gin Gin Glu Tyr 
115 120 125 

Gin Gin Leu His Ala Val Trp Glu His Lys Leu Gly Ser Gin Val Ser 
130 135 140 



Leu Val Ser Gly Ser Ser Ser Ser Ser Lys Tyr Asp Pro Glu He Leu 
145 150 155 160 
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Lys Ala Glu He Ala Thr Ala Lys Ser. Arg Val Asn Lys Leu Lys Arg 
165 170 175 



Glu Met Val His Leu Gin His Glu Leu Gin Phe Lys Glu Arg Gly Phe 
180 185 190 



Gin- Thr Leu Lys Lys He Asp Lys Lys Met Ser Asp Ala Gin Gly Ser 
195 200 .205 



Tyr Lys Leu Asp Glu Ala Gin Ala Val Leu Arg Glu Thr Lys Ala He 
210 215 220 



Lys Lys Ala He Thr Cys Gly Glu Lys Glu Lys Gin Asp Leu He Lys 
225 230 235 240 



Ser Leu Ala Met Leu Lys Asp Gly Phe Arg Thr Asp Arg Gly Ser His 
245 250 . 255 



Ser Asp Leu Trp Ser Ser Ser Ser Ser Leu Glu Ser Ser Ser Phe Pro 
260 265 270 



Leu Pro Lys Gin Tyr Leu Asp Val Ser Ser Gin Thr Asp He Ser Gly 
275 280 285 



Ser Phe Gly He Asn Ser Asn Asn Gin Leu Ala Glu Lys Val Arg Leu 
290 295 300 



Arg Leu Arg Tyr Glu Glu Ala Lys Arg Arg He Ala Asn Leu Lys He 
305 310 315 320 



Gin Leu Ala Lys Leu Asp Ser Glu Ala Trp Pro Gly Val Leu Asp Ser 
325 . 330 335 



Glu Arg Asp Arg Leu He Leu He Asn Glu Lys Glu Glu Leu Leu Lys 
340 345 350 



Glu Met Arg Phe He Ser Pro Arg Lys Trp Thr Gin Gly Glu Val Glu 
355 360 365 



Gin Leu Glu Met Ala Arg Lys Arg Leu Glu Lys Asp Leu Gin Ala Ala 
370 375 380 



Arg Asp Thr Gin Ser Lys Ala Leu Thr Glu Arg Leu Lys Leu Asn Ser 
385 390 395 400 



Lys Arg Asn Gin Leu Val Arg Glu Leu Glu Glu Ala Thr Arg Gin Val 
405 410 415 



Ala Thr Leu His Ser Gin Leu Lys Ser Leu Ser Ser Ser Met Gin Ser 
420 425 430 
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Leu Ser Ser Gly Ser Ser Pro Gly Ser Leu Thr Ser Ser Arg Gly Ser 
435 440 445 



Leu Val Ala Ser Ser Leu Asp Ser Ser Thr Ser Ala Ser Phe Thr Asp 
450 455 460 



Leu Tyr Tyr Asp Pro Phe Glu Gin Leu Asp Ser Glu Leu Gin Ser Lys 
465 470 475 480 



Val Glu Phe Leu Leu Leu Glu Gly Ala Thr Gly Phe Arg Pro Ser Gly 
485 490 495 



Cys He Thr Thr He His Glu Asp Glu Val Ala Lys Thr Gin Lys Ala 
500 505 510 



Glu Gly Gly Gly Arg Leu Gin Ala Leu Arg Ser Leu Ser Gly Thr Pro 
515 520 525 



Lys Ser Met Thr Ser Leu Ser Pro Arg Ser Ser Leu Ser Ser Pro Ser 
530 535 ' 540 



Pro Pro Cys Ser Pro Leu Met Ala Asp Pro Leu Leu Ala Gly Asp Ala 
545 550 555 560 



Phe Leu Asn Ser Leu Glu Phe Glu Asp Pro Glu Leu Ser Ala Thr Leu 
565 570 575 



Cys Glu Leu Ser Leu Gly Asn Ser Ala Gin Glu Arg Tyr Arg Leu Glu 
580 585 590 



Glu Pro Gly Thr Glu Gly Lys Gin Leu Gly Gin Ala Val Asn Thr Ala 
595 600 605 



Gin Gly Cys Gly Leu Lys Val Ala Cys Val Ser Ala Ala Val Ser Asp 
610 615 620 



Glu Ser Val Ala Gly Asp Ser Gly Val Tyr Glu Ala Ser Val Gin Arg 
625 630 635 640 



Leu Gly Ala Ser Glu Ala Ala Ala Phe Asp Ser Asp Glu Ser Glu Ala 
645 650 655 



Val Gly Ala Thr Arg He Gin He Ala Leu Lys Tyr Asp Glu Lys Asn 
660 665 670 



Lys Gin Phe Ala He Leu He He Gin Leu Ser Asn Leu Ser Ala Leu 
675 680 685 



Leu Gin Gin Gin Asp Gin Lys Val Asn He Arg Val Ala Val Leu Pro 
690 695 700 
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Cys Ser Glu Ser Thr Thr Cys Leu Phe Arg Thr Arg Pro Leu Asp Ala 
705 710 " 715 " 720 



Ser Asp Thr Leu Val Phe Asn Glu Val Phe Trp Val Ser Met Ser Tyr 
725 .730 735 



Pro Ala Leu His Gin Lys Thr Leu Arg Val Asp Val Cys Thr Thr Asp 
740 745 750 



Arg Ser His Leu Glu Glu Cys Leu Gly Gly Ala Gin He Ser Leu Ala 
755 760 765 



Glu Val Cys Arg Ser Gly Glu Arg Ser Thr Arg Trp Tyr Asn Leu Leu 
770 775 780 



Ser Tyr Lys Tyr Leu Lys Lys Gin Ser Arg Glu Leu Lys Pro Val Gly 
785 790 795 800 



Val Met Ala Pro Ala Ser Gly Pro Ala Ser Thr Asp Ala Val Ser Ala 
805 810 815 



Leu Leu Glu Gin Thr Ala Val Glu Leu Glu Lys Arg Gin Glu Gly Arg 
820 825 830 



Ser Ser Thr Gin Thr Leu Glu Asp Ser Trp Arg Tyr Glu Glu Thr Ser 
835 840 845 



Glu Asn Glu Ala Val Ala Glu Glu Glu Glu Glu Glu Val Glu Glu Glu 
850 855 860 



Glu Gly Glu Glu Asp Val Phe Thr Glu Lys Ala Ser Pro Asp Met Asp 
865 " 870 875 880 



Gly Tyr Pro Ala Leu Lys Val Asp Lys Glu Thr Asn Thr Glu Thr Pro 
885 890 895 



Ala Pro Ser Pro Thr Val Val Arg Pro Lys Asp Arg Arg Val Gly Thr 
900 905 910 



Pro Ser Gin Gly Pro Phe Leu Arg Gly Ser Thr He He Arg Ser Lys 
915 920 925 



Thr Phe Ser Pro Gly Pro Gin Ser Gin Tyr Val Cys Arg Leu Asn Arg 
930 935 940 



Ser Asp Ser Asp Ser Ser Thr Leu Ser Lys Lys Pro Pro Phe Val Arg 
945 950 955 960 



Asn Ser Leu Glu Arg Arg Ser Val Arg Met Lys Arg Pro Ser Ser Val 
965 970 975 
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Lys Ser Leu Arg Ser Glu Arg Leu He Arg Thr Ser Leu Asp Leu Glu 
980 985 990 



Leu Asp Leu Gin Ala Thr Arg Thr Trp His Ser Gin Leu Thr Gin Glu 
995 1000 1005 



Lys Glu Gin Leu Glu Gin Ala Lys 
1020 



Pro Gin Trp Leu Arg Glu Asp Glu 
1035 



Met Leu Glu Lys Arg Gin Met Asp 
1050 



Leu Gin Thr Asp Lys Met Met Arg 
1065 



Arg Leu Arg Gly Gin Ser Cys Lys 
1080 



Phe Arg Glu Lys Met Ala Phe Phe 
1095 



Pro Ala Leu Ser Ala Asp Asp Val 
1110 



<210> 15 

<211> 1294 

<212> DNA 

<213> NM_001306 CLDN3, claudin 3 



.lie Ser Val Leu Lys Glu Leu 
1010 1015 



Ser His Gly Glu Lys Glu Leu 
1025 1030 



Arg Phe Arg Leu Leu Leu Arg 
1040 1045 



Arg Ala Glu His Lys Gly Glu 
1055 1060 



Ala Ala Ala Lys Asp Val His 
1070 1075 



Glu Pro Pro Glu Val Gin Ser 
1085 1090 



Thr Arg Pro Arg Met Asn He 
1100 1105 



<220> 

<221> CDS 

<222> (198) . . (857) 

<223> 



<400> 15 

caaagccaca ggcaggtgca ggcgcagccg cggcgagagc gtatggagcc gagccgttag 60 

cgcgcgccgt cggtgagtca gtccgtccgt ccgtccgtcc gtcggggcgc cgcagctccc 120 

gccaggccca gcggccccgg cccctcgtct ccccgcaccc ggagccaccc ggtggagcgg 180 

gccttgccgc ggcagcc atg tec atg ggc ctg gag ate acg ggc acc gcg 230 
Met Ser Met Gly Leu Glu lie Thr Gly Thr Ala 
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1 -5 10 

ctg gcc gtg ctg ggc tgg ctg ggc acc ate gtg tgc tgc gcg ttg ccc 278 
Leu Ala Val Leu Gly Trp Leu Gly Thr lie Val Cys Cys Ala Leu Pro 
15 20 ~ 25 

atg tgg cgc gtg teg gcc ttc ate ggc age aac ate ate acg teg cag 326 
Met Trp Arg Val Ser Ala Phe lie Gly Ser Asn lie lie Thr Ser Gin 
30 35 40 

aac ate tgg gag ggc ctg tgg atg aac tgc gtg gtg cag age acc ggc 374 
Asn He Trp Glu Gly Leu Trp Met Asn Cys Val Val Gin Ser Thr Gly 
45 50 55 

cag atg cag tgc aag gtg tac gac teg ctg ctg gca ctg cca cag gac 422 
Gin Met Gin Cys Lys Val Tyr Asp Ser Leu Leu Ala Leu Pro Gin Asp 
60 65 70 75 

ctt cag gcg gcc cgc gcc etc ate gtg gtg gcc ate ctg ctg gcc gcc 470 
Leu Gin Ala Ala Arg Ala Leu He Val Val Ala He Leu Leu Ala Ala 
80 85 90 

ttc ggg ctg eta gtg gcg ctg gtg ggc gcc cag tgc acc aac tgc gtg 518 
Phe Gly Leu Leu Val Ala Leu Val Gly Ala Gin Cys Thr Asn Cys Val 
95 100 105 

cag gac gac acg gcc aag gcc aag ate acc ate gtg gca ggc gtg ctg 566 
Gin Asp Asp Thr Ala Lys Ala Lys He Thr He Val Ala Gly Val Leu 
110 115 120 

ttc ctt etc gcc gcc ctg etc acc etc gtg ccg gtg tec tgg teg gcc 614 
Phe Leu Leu Ala Ala Leu Leu Thr Leu Val Pro Val Ser Trp Ser Ala 
125 130 135 

aac acc att ate egg gac ttc tac aac ccc gtg gtg ccc gag gcg cag 662 
Asn Thr He He Arg Asp Phe Tyr Asn Pro .Val Val Pro Glu Ala Gin 
140 145 ; 150 155 

aag cgc gag atg ggc gcg ggc ctg tac gtg ggc tgg gcg gcc gcg gcg 710 
Lys Arg Glu Met Gly Ala Gly Leu Tyr Val Gly Trp Ala Ala Ala Ala 
160 165 170 

ctg cag ctg ctg ggg ggc gcg ctg etc tgc tgc teg tgt ccc cca cgc 758 
Leu Gin Leu Leu Gly Gly Ala Leu Leu Cys Cys Ser Cys Pro Pro Arg 
175 180 185 

gag aag aag tac acg gcc acc aag gtc gtc tac tec gcg ccg cgc tec 806 
Glu Lys Lys Tyr Thr Ala Thr Lys Val Val Tyr Ser Ala Pro Arg Ser 
190 195 200 

acc ggc ccg gga gcc age ctg ggc aca ggc tac gac cgc aag gac tac 854 
Thr Gly Pro Gly Ala Ser Leu Gly Thr Gly Tyr Asp Arg Lys Asp Tyr 
205 210 215 

gtc taagggacag aegcagggag accccaccac caccaccacc accaacacca 907 

Val 

220 

ccaccaccac cgcgagctgg agcgcgcacc aggccatcca gcgtgcagcc ttgcctcgga 967 

ggccagccca cccccagaag ccaggaagcc cccgcgctgg actggggcag cttccccagc 1027 

agccacggct ttgcgggccg ggcagtcgac ttcggggccc agggaccaac ctgcatggac 1087 

tgtgaaacct cacccttctg gagcaegggg cctgggtgac cgccaatact tgaccacccc 1147 

gtcgagcccc atcgggccgc tgcccccatg ctcgcgctgg gcagggaccg gcagccctgg 1207 
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aaggggcact tgatattttt caataaaagc ctttcgtttt gcaaaaaaaa aaaaaaaaaa 1267 
aaaaaaaaaa aaaaaaaaaa aaaaaaa 1294 

<210> 16 
<211> 220 
<212> PRT 

<213> NMJ)01306 CLDN3 , claudin 3 
<400> 16 

Met Ser Met Gly Leu Glu He Thr Gly Thr Ala Leu Ala Val Leu Gly 
15 10 15 



Trp Leu Gly Thr He Val Cys Cys Ala Leu Pro Met Trp Arg Val Ser 
20 25 30 



Ala Phe He Gly Ser Asn He He Thr Ser Gin Asn He Trp Glu Gly 
35 40 45 



Leu Trp Met Asn Cys Val Val Gin Ser Thr Gly Gin Met Gin Cys Lys 
50 55 60 



Val Tyr Asp Ser Leu Leu Ala Leu Pro Gin Asp Leu Gin Ala Ala Arg 
65 70 75 80 



Ala Leu He Val Val Ala lie Leu Leu Ala Ala Phe Gly Leu Leu Val 
85 90 95 



Ala Leu Val Gly Ala Gin Cys Thr Asn Cys Val Gin Asp Asp Thr Ala 
100 105 110 



Lys Ala Lys He Thr He Val Ala Gly Val Leu Phe Leu Leu Ala Ala 
115 120 125 



Leu Leu Thr Leu Val Pro Val Ser Trp Ser Ala Asn Thr He He Arg 
130 135 140 



Asp Phe Tyr Asn Pro Val Val Pro Glu Ala Gin Lys Arg Glu Met Gly 
145 150 155 160 



Ala Gly Leu Tyr Val Gly Trp Ala Ala Ala Ala Leu Gin Leu Leu Gly 
165 170 175 



Gly Ala Leu Leu Cys Cys Ser Cys Pro Pro Arg Glu Lys Lys Tyr Thr 
180 185. 190 



Ala Thr Lys Val Val Tyr Ser Ala Pro Arg Ser Thr Gly Pro Gly Ala 
195 200 205 
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Ser Leu Gly Thr Gly Tyr Asp Arg Lys Asp Tyr Val 
210 215 220 



<210> 17 

<211> 1853 

<212> DNA 

<213> NM 022454 SOX17 



<220> 

<221> CDS 

<222> (205) (1446) 

<223> 



<400> 17 

gcagtgtcac taggccggct gggggccctg ggtacgctgt agaccagacc gcgacaggcc 60 

agaacacggg cggcggcttc gggccgggag acccgcgcag ccctcggggc atctcagtgc 120 

ctcactcccc accccctccc ccgggtcggg ggaggcggcg cgtccggcgg agggttgagg 180 

ggagcggggc aggcctggag cgcc atg age age ccg gat gcg gga tac gee 231 

Met Ser Ser Pro Asp Ala Gly Tyr Ala 
1 5 

agt gac gac cag age cag acc cag age gcg ctg ccc gcg gtg atg gee 279 
Ser Asp Asp Gin Ser Gin Thr Gin Ser Ala Leu Pro Ala Val Met Ala 
10 15 20 25 

ggg ctg ggc ccc tgc ccc tgg gec gag teg ctg age ccc ate ggg gac 327 
Gly Leu Gly Pro Cys Pro Trp Ala Glu Ser Leu Ser Pro lie Gly Asp 
30 35 40 

atg aag gtg aag ggc gag gcg ccg gcg aac age gga gca ccg gee ggg 375 
Met Lys Val Lys Gly Glu Ala Pro Ala Asn Ser Gly Ala Pro Ala Gly 
45 50 "55 

gee gcg ggc cga gee aag ggc gag tec cgt ate egg egg ccg atg aac 423 
Ala Ala Gly Arg Ala Lys Gly Glu Ser Arg lie Arg Arg Pro Met Asn 
60 65 70 

get ttc atg gtg tgg get aag gac gag cgc aag egg ctg gcg cag cag 471 

Ala Phe Met Val Trp Ala Lys Asp Glu Arg Lys Arg Leu Ala Gin Gin 
75 80 85 

aat cca gac ctg cac aac gec gag ttg age aag atg ctg ggc aag teg 519 
Asn Pro Asp Leu His Asn Ala Glu Leu Ser Lys Met Leu Gly Lys Ser 
90 95 100 . 105 

tgg aag gcg ctg acg ctg gcg gag aag egg ccc ttc gtg gag gag gca 567 
Trp Lys Ala Leu Thr Leu Ala Glu Lys Arg Pro Phe Val Glu Glu Ala 
110 115 120 



gag egg ctg cgc gtg cag cac atg cag gac cac ccc aac tac aag tac 
Glu Arg Leu Arg Val Gin His Met Gin Asp His Pro Asn Tyr Lys Tyr 
125 130 135 



615 
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egg ccg egg egg cgc aag cag gtg aag egg ctg aag egg gtg gag ggc 663 
Arg Pro Arg Arg Arg Lys Gin Val Lys Arg Leu Lys Arg Val Glu Gly 
140 145 ; 150 

ggc ttc ctg cac ggc ctg get gag ccg cag gcg gee gcg ctg ggc ccc 711 
Gly Phe Leu His Gly Leu Ala Glu Pro Gin Ala Ala Ala Leu Gly Pro 
155 160 165 

gag ggc ggc cgc gtg gee atg gac ggc ctg ggc etc cag ttc ccc gag 759 
Glu Gly Gly Arg Val Ala Met Asp Gly Leu Gly Leu Gin Phe Pro Glu 
170 175 180 185 

cag ggc ttc ccc gee ggc ccg ccg ctg ctg cct ccg cac atg ggc ggc 807 
Gin Gly Phe Pro Ala Gly Pro Pro Leu Leu Pro Pro His Met Gly Gly 
190 195 ■ 200 

cac tac cgc gac tgc cag agt ctg ggc gcg cct ccg etc gac ggc tac 855 

His Tyr Arg Asp Cys Gin Ser Leu Gly Ala Pro Pro Leu Asp Gly Tyr 
205 . 210 215 

ccg ttg ccc acg ccc gac acg tec ccg ctg gac ggc "gtg gac ccc gac 903 
Pro Leu Pro Thr Pro Asp Thr Ser Pro Leu Asp Gly Val Asp Pro Asp 
220 225 230 

ccg get ttc ttc gee gee ccg atg ccc ggg gac tgc ccg gcg gee ggc 951 
Pro Ala Phe Phe Ala Ala Pro Met Pro Gly Asp Cys Pro Ala Ala Gly 
235 240 245 

ace tac age tac gcg cag gtc teg gac tac get ggc ccc ccg gag cct 999 
Thr Tyr Ser Tyr Ala Gin Val Ser Asp Tyr Ala Gly Pro Pro Glu Pro 
250 255 260 265 

ccc gee ggt ccc atg cac ccc cga etc ggc cca gag ccc gcg ggt ccc 1047 
Pro Ala Gly Pro Met His Pro Arg Leu Gly Pro Glu Pro Ala Gly Pro 
270 275 280 

teg att ccg ggc etc ctg gcg cca ccc age gee ctt cac gtg tac tac 1095 
Ser lie Pro Gly Leu Leu Ala Pro Pro Ser Ala Leu His Val Tyr Tyr 
285 290 295 

ggc gcg atg ggc teg ccc ggg gcg ggc ggc ggg cgc ggc ttc cag atg 1143 
Gly Ala Met Gly Ser Pro Gly Ala Gly Gly Gly Arg Gly Phe Gin Met 
300 305 310 

cag ccg caa cac cag cac cag cac cag cac cag cac cac ccc ccg ggc 1191 
Gin Pro Gin His Gin His Gin His Gin His Gin His His Pro Pro Gly 
315 320 325 

ccc gga cag ccg teg ccc cct ccg gag gca ctg ccc tgc egg gac ggc 1239 

Pro Gly Gin Pro Ser Pro Pro Pro Glu Ala Leu Pro Cys Arg Asp Gly 
330 335 340 345 

acg gac ccc agt cag ccc gee gag etc etc ggg gag gtg gac cgc acg 1287 
Thr Asp Pro Ser Gin Pro Ala Glu Leu Leu Gly Glu Val Asp Arg Thr 
350 355 * 360 

gaa ttt gaa cag tat ctg cac ttc gtg tgc aag cct gag atg ggc etc 1335 
Glu Phe Glu Gin Tyr Leu His Phe Val Cys Lys Pro Glu Met Gly Leu 
365 370 375 

ccc tac cag ggg cat gac tec ggt gtg aat etc ccc gac age cac ggg . 1383 
Pro Tyr Gin Gly His Asp Ser Gly Val Asn Leu Pro Asp Ser His Gly 
380 385 390 

gee att tec teg gtg gtg tec gac gee age tec gcg gta tat tac tgc 1431 
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Ala He Ser Ser Val Val Ser Asp Ala Ser Ser Ala Val Tyr Tyr Cys 
395 400 405 

aac tat cct gac gtg tgacaggtcc ctgatccgcc ccagcctgca ggccagaagc 1486 

Asn Tyr Pro Asp Val 

410 

agtgttacac acttcctgga ggagctaagg aaatcctcag actcctgggt ttttgttgtt 1546 

gctgttgttg ttttttaaaa ggtgtgttgg catataattt atggtaattt attttgtctg 1606 

ccacttgaac agtttggggg ggtgaggttt catttaaaat ttgttcagag atttgtttcc 1666 

catagttgga ttgtcaaaac cctatttcca agttcaagtt aactagcttt gaatgtgtcc 1726 

caaaacagct tcctccattt cctgaaagtt tattgatcaa agaaatgttg tcctgggtgt 1786 

gttttttcaa tcttctaaaa aataaaatct ggaatcctga aaaa'aaaaaa aaaaaaaaaa 1846 

aaaaaaa 1853 

<210> 18 
<211> 414 
<212> PRT 

<213> NM_022454 SOX17 . 
<400> 18 

Met Ser Ser Pro Asp Ala Gly Tyr Ala Ser Asp Asp Gin Ser Gin Thr 
15 10 15 



Gin Ser Ala Leu Pro Ala Val Met Ala Gly Leu Gly Pro Cys Pro Trp 
20 25 30 



Ala Glu Ser Leu Ser Pro He Gly Asp Met Lys Val Lys Gly Glu Ala 
35 40 . 45 



Pro Ala Asn Ser Gly Ala Pro Ala Gly Ala Ala Gly Arg Ala Lys Gly 
50 55 60 



Glu Ser Arg He Arg Arg Pro Met Asn Ala Phe Met Val Trp Ala Lys 
65 70 75 80 



Asp Glu Arg Lys Arg Leu Ala Gin Gin Asn Pro Asp Leu His Asn Ala 
85 90 95 



Glu Leu Ser Lys Met Leu Gly Lys Ser Trp Lys Ala Leu Thr Leu Ala 
100 105 110 



Glu Lys Arg Pro Phe Val Glu Glu Ala Glu Arg Leu Arg Val Gin His 
115 120 125 



Met Gin Asp His Pro Asn Tyr Lys Tyr Arg Pro Arg Arg Arg Lys Gin 
130 135 140 
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Val Lys Arg Leu Lys Arg Val Glu Gly Gly Phe Leu His Gly Leu Ala 
145 " 150 155 160 



Glu Pro Gin Ala Ala Ala Leu Gly Pro Glu Gly Gly Arg Val Ala Met 
165 • 170 175 



Asp Gly Leu Gly Leu Gin Phe Pro Glu Gin Gly Phe Pro Ala Gly Pro 
180 185 190 



Pro Leu Leu Pro Pro His Met Gly Gly His Tyr Arg Asp Cys Gin Ser 
195 200 205 



Leu Gly Ala Pro Pro Leu Asp Gly Tyr Pro Leu Pro Thr Pro Asp Thr 
210 215 220 



Ser Pro Leu Asp Gly Val Asp Pro Asp Pro Ala Phe Phe Ala Ala Pro 
225 230 235 240 



Met Pro Gly Asp Cys Pro Ala Ala Gly Thr Tyr Ser Tyr Ala Gin Val 
245 250 255 



Ser Asp Tyr Ala Gly Pro Pro Glu Pro Pro Ala Gly Pro Met His Pro 
260 265 270 



Arg Leu Gly Pro Glu Pro Ala Gly Pro Ser lie Pro Gly Leu Leu Ala 
275 280 285 



Pro Pro Ser Ala Leu His Val Tyr Tyr Gly Ala Met Gly Ser Pro Gly 
290 295 300 



Ala Gly Gly Gly Arg Gly Phe Gin Met Gin Pro Gin His Gin His Gin 
305 310 315 320 



His Gin His Gin His His Pro Pro Gly Pro Gly Gin Pro Ser Pro Pro 
325 330 335 



Pro Glu Ala Leu Pro Cys Arg Asp Gly Thr Asp Pro Ser Gin Pro Ala 
340 345 350 



Glu Leu Leu Gly Glu Val Asp .Arg Thr Glu Phe Glu Gin Tyr Leu His 
355 360 365 



Phe Val Cys Lys Pro Glu Met Gly Leu Pro Tyr Gin Gly His Asp Ser 
370 375 380 



Gly Val Asn Leu Pro Asp Ser His Gly Ala lie Ser Ser Val Val Ser 
385 390, 395 400 



Asp Ala Ser Ser Ala Val Tyr Tyr Cys Asn Tyr Pro Asp Val 
405 410 
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<210> 19 

<211> 3702 

<212> DNA 

<213> NMJJ05682 GPR56 
<220> 

<221> CDS 

<222> (159) . . (2237) 

<223> 



<400> 19 

tggagggtct cgctctgtca cacaggctgg agtgcagtgg tgtgatcttg gctcatcgta 60 

acctccacct cccgggttca agtgattctc atgcctcagc ctcccgagta gctgggatta 120 

caggtggtga cttccaagag tgactccgtc ggaggaaa atg act ccc cag teg ctg 176 

Met Thr Pro Gin Ser Leu 
1 5 

ctg cag acg aca ctg ttc ctg ctg agt ctg etc ttc ctg gtc caa ggt 224 
Leu Gin Thr Thr Leu Phe Leu Leu Ser Leu Leu Phe Leu Val Gin Gly 
10 15 - 20 

gee cac ggc agg ggc cac agg gaa gac ttt cgc ttc tgc age cag egg 272 
Ala His Gly Arg Gly His Arg Glu Asp Phe Arg Phe Cys Ser Gin Arg 
25 30 35 

aac cag aca cac agg age age etc cac tac aaa ccc aca cca gac ctg 320 
Asn Gin Thr His Arg Ser Ser Leu His Tyr Lys Pro Thr Pro Asp Leu 
40 45 50 

cgc ate tec ate gag aac tec gaa gag gee etc aca gtc cat gee cct 368 
Arg lie Ser He Glu Asn Ser Glu Glu Ala Leu thr Val His Ala Pro 
55 60 65 70 

ttc cct gca gee cac cct get tec cga tec ttc cct gac ccc agg ggc 416 
Phe Pro Ala Ala His Pro Ala Ser Arg Ser Phe Pro Asp Pro Arg Gly 
75 80 85 

etc tac cac ttc tgc etc tac tgg aac cga cat get ggg aga tta cat 464 
Leu Tyr His Phe Cys Leu Tyr Trp Asn Arg His Ala Gly Arg Leu His 
90 95 100 

ctt etc tat ggc aag cgt gac ttc ttg ctg agt gac aaa gee tct age 512 
Leu Leu Tyr Gly Lys Arg Asp Phe Leu Leu Ser Asp Lys Ala Ser Ser 
105 HO 115 

etc etc tgc ttc cag cac cag gag gag age ctg get cag ggc ccc ccg 560 
Leu Leu Cys Phe Gin His Gin Glu Glu Ser Leu Ala Gin Gly Pro Pro 
120 125 130 - 

ctg tta gec act tct gtc acc tec tgg tgg age cct cag aac ate age 608 
Leu Leu Ala Thr Ser Val Thr Ser Trp Trp Ser Pro Gin Asn He Ser 
135 140 145 150 

ctg ccc agt gee gec age ttc acc ttc tec ttc cap agt cct ccc cac 656 
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Leu Pro Ser Ala Ala Ser Phe Thr Phe.Ser Phe His Ser Pro Pro His 
155 160 165 

acg gcc get" cac aat gec teg gtg gac atg tgc gag etc aaa agg gac 704 
Thr Ala Ala His Asn Ala Ser Val Asp Met Cys Glu Leu Lys Arg Asp 
170 175 180 

etc cag ctg etc age cag ttc ctg aag cat ccc cag aag gcc tea agg 752 
Leu Gin Leu Leu Ser Gin Phe Leu Lys His Pro Gin Lys Ala Ser Arg 
185 190 195 

agg ccc teg get gcc ccc gcc age cag cag ttg cag age ctg gag teg 800 
Arg Pro Ser Ala Ala Pro Ala Ser Gin Gin lieu Gin Ser Leu Glu Ser 
200 205 210 

aaa ctg ace tct gtg aga ttc atg ggg gac atg gtg tec ttc gag gag 848 
Lys Leu Thr Ser Val Arg Phe Met Gly Asp Met Val Ser Phe Glu Glu 
*215 220 225 230 

gac egg ate aac gcc acg gtg tgg aag etc cag ccc aca gcc ggc etc 896 
Asp Arg He Asn Ala Thr Val Trp Lys Leu Gin Pro Thr Ala Gly Leu 
235 240 245 

cag gac ctg cac ate cac tec egg cag gag gag gag cag age gag ate 944 
Gin Asp Leu His He His Ser Arg Gin Glu Glu Glu Gin Ser Glu He 
250 255 260 

atg gag tac teg gtg ctg ctg cct cga aca etc ttc cag agg acg aaa 992 
Met Glu Tyr Ser Val Leu Leu Pro Arg Thr Leu Phe Gin Arg Thr Lys 
265 270 . 275 

ggc egg age ggg gag get gag aag aga etc etc ctg gtg gac ttc age 1040 
Gly Arg Ser Gly Glu Ala Glu Lys Arg Leu Leu Leu Val Asp Phe Ser 
280 285 290 

age caa gcc ctg ttc cag gac aag aat tec age cac gtc ctg ggt gag 1088 
Ser Gin Ala Leu Phe Gin Asp Lys Asn Ser Ser His Val Leu Gly Glu 
2.95 300 305 . 310 

aag gtc ttg ggg att gtg gta cag aac acc aaa gta gcc aac etc acg 1136 
Lys Val Leu Gly He Val Val Gin Asn Thr Lys Val Ala Asn Leu Thr 
315 320-. 325 

gag ccc gtg gtg etc acc ttc cag cac cag eta cag ccg aag aat gtg 1184 
Glu Pro Val Val Leu Thr Phe Gin His Gin Leu Gin Pro Lys Asn Val 
330 335 340 

act ctg caa tgt gtg ttc tgg gtt gaa gac ccc aca ttg age age ccg 1232 
Thr Leu Gin Cys Val Phe Trp Val Glu Asp Pro Thr Leu Ser Ser Pro 
345 350 355 

ggg cat tgg age agt get ggg tgt gag acc gtc agg aga gaa acc caa 1280 
Gly His Trp Ser Ser Ala Gly Cys Glu Thr Val Arg Arg Glu Thr Gin 
360 365 370 

aca tec tgc ttc tgc aac cac ttg acc tac ttt gca gtg ctg atg gtc 1328 
Thr Ser Cys Phe Cys Asn His Leu Thr Tyr Phe Ala Val Leu Met Val 
375 380 385 390 

tec teg gtg gag gtg gac gcc gtg cac aag cac tac ctg age etc etc 1376 
Ser Ser Val Glu Val Asp Ala Val His Lys His Tyr Leu Ser Leu Leu 
395 400 405 

tec tac gtg ggc tgt gtc gtc tct gcc ctg gcc tgc ctt gtc acc att 1424 
Ser Tyr Val Gly Cys Val Val Ser Ala Leu Ala Cys Leu Val Thr He 
410 415 420 

gcc gcc tac etc tgc tec agg gtg ccc ctg ccg tgc agg agg aaa cct 1472 
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Ala Ala Tyr Leu Cys Ser Arg Val Pro Leu Pro Cys Arg Arg Lys Pro 

425 430 435 

egg gac tac acc ate aag gtg cac atg aac ctg ctg ctg gec gtc ttc 1520 

Arg Asp Tyr Thr lie Lys Val His Met Asn Leu Leu Leu Ala Val Phe 

440 445 450 

ctg ctg gac acg age ttc ctg etc age gag ccg gtg gee ctg aca ggc 1568 

Leu Leu Asp Thr Ser Phe Leu Leu Ser Glu Pro Val Ala Leu Thr Gly 
455 460 465 470 

tct gag get ggc tgc cga gee agt gee ate ttc ctg cac ttc tec ctg 1616 

Ser Glu Ala Gly Cys Arg Ala Ser Ala lie Phe Leu His Phe Ser Leu 
475 480 485 

etc acc tgc ctt tec tgg atg ggc etc gag ggg tac aac etc tac cga 1664 

Leu Thr Cys Leu Ser Trp Met Gly Leu Glu Gly Tyr Asn Leu Tyr Arg 
490 495 500 

etc gtg gtg gag gtc ttt ggc acc tat gtc cct ggc tac eta etc aag 1712 

Leu Val Val Glu Val Phe Gly Thr Tyr Val Pro Gly Tyr Leu Leu Lys 

505 510 515 

ctg age gee atg ggc tgg ggc ttc ccc ate ttt ctg gtg acg ctg gtg 1760 

Leu Ser Ala Met Gly Trp Gly Phe Pro lie Phe Leu Val Thr Leu Val 

520 525 530 

gee ctg gtg gat gtg gac aac tat ggc ccc ate ate ttg get gtg cat 1808 

Ala Leu Val Asp Val Asp Asn Tyr Gly Pro lie lie Leu Ala Val His 
535 540 545 550 

agg act cca gag ggc gtc ate tac cct tec atg tgc tgg ate egg gac 1856 

Arg Thr Pro Glu Gly Val He Tyr Pro Ser Met Cys Trp He Arg Asp 
555 560 565 

tec ctg gtc age tac ate acc aac ctg ggc etc ttc age ctg gtg ttt 1904 

Ser Leu Val Ser Tyr He Thr Asn Leu Gly Leu Phe Ser Leu Val Phe 
570 575 580 

ctg ttc aac atg gee atg eta gee acc atg gtg gtg cag ate ctg egg 1952 

Leu Phe Asn Met Ala Met Leu Ala Thr Met Val Val Gin He Leu Arg 

585 -590 595 

ctg cgc ccc cac acc caa aag tgg tea cat gtg ctg aca ctg ctg ggc 2000 

Leu Arg Pro His Thr Gin Lys Trp Ser His Val Leu Thr Leu Leu Gly 

600 605 610 

etc age ctg gtc ctt ggc ctg ccc tgg gee ttg ate ttc ttc tec ttt 2048 

Leu Ser Leu Val Leu Gly Leu Pro Trp Ala Leu He Phe Phe Ser Phe 
615 620 625 630 

get tct ggc acc ttc cag ctt gtc gtc etc tac ctt ttc age ate ate 2096 

Ala Ser Gly Thr Phe Gin Leu Val Val Leu Tyr Leu Phe Ser He He 
635 640 645 

acc tec ttc caa ggc ttc etc ate ttc ate tgg tac tgg tec atg egg 2144 

Thr Ser Phe Gin Gly Phe Leu He Phe He Trp Tyr Trp Ser Met Arg 
650 655 660 

ctg cag gee egg ggt ggc ccc tec cct ctg aag age aac tea gac age 2192 

Leu Gin Ala Arg Gly Gly Pro Ser Pro Leu Lys Ser Asn Ser Asp Ser 

665 670 675 

gee agg etc ccc ate age teg ggc age acc teg tec age cgc ate 2237 

Ala Arg Leu Pro He Ser Ser Gly Ser Thr Ser Ser Ser Arg lie 

680 685 690 

. taggcctcca gcccacctgc ccatgtgatg aagcagagat gcggcctcgt cgcacactgc 2297 
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ctgtggcccc cgagccaggc ccagccccag gccagtcagc cgcagacttt ggaaagccca 2357 

acgaccatgg agagatgggc cgttgccatg gtggacggac tcccgggctg ggcttttgaa 2417 

ttggccttgg ggactactcg gctctcactc agctcccacg ggactcagaa gtgcgccgcc 2477 

atgctgccta gggtactgtc cccacatctg tcccaaccca gctggaggcc tggtctctcc 2537 

ttacaaaccc tgggcccagc cctcattgct gggggccagg ccttggatct tgagggtctg 2597 

gcacatcctt aatcctgtgc ccctgcctgg gacagaaatg tggctccagt tgctctgtct 2657 

ctcgtggtca ccctgagggc actctgcatc ctctgtcatt ttaacctcag gtggcaccca 2717 

gggcgaatgg ggcccagggc agaccttcag ggccagagcc ctggcggagg agaggccctt 2777 

tgccaggagc acagcagcag ctcgcctacc tctgagccca ggccccctcc ctccctcagc 2837 

cccccagtcc tccctccatc ttccctgggg ttctcctcct ctcccagggc ctccttgctc 2897 

cttcgttcac agctgggggt ccccgattcc aatgctgttt tttggggagt ggtttccagg 2957 

agctgcctgg tgtctgctgt aaatgtttgt ctactgcaca agcctcggcc tgcccctgag 3017 

ccaggctcgg taccgatgcg tgggctgggc taggtccctc tgtccatctg ggcctttgta 3077 

tgagctgcat tgcccttgct caccctgacc aagcacacgc ctcagagggg ccctcagcct 3137 

ctcctgaagc cctcttgtgg caagaactgt ggaccatgcc agtcccgtct ggtttccatc 3197 

ccaccactcc aaggactgag actgacctcc tctggtgaca ctggcctaga gcctgacact 3257 

ctcctaagag gttctctcca agcccccaaa tagctccagg cgccctcggc cgcccatcat 3317 

ggttaattct gtccaacaaa cacacacggg tagattgctg gcctgttgta ggtggtaggg 3377 

acacagatga ccgacctggt cactcctcct gccaacattc agtctggtat gtgaggcgtg 3437 

cgtgaagcaa gaactcctgg agctacaggg acagggagcc atcattcctg cctgggaatc 34 97 

ctggaagact tcctgcagga gtcagcgttc aatcttgacc ttgaagatgg gaaggatgtt 3557 

ctttttacgt accaattctt ttgtcttttg atattaaaaa gaagtacatg ttcattgtag 3617 

agaatttgga aactgtagaa gagaatcaag aagaaaaata aaaatcagct gttgtaatcg 3677 
cctagcaaaa aaaaaaaaaa aaaaa 

<210> 20 
<211> 693 
<212> PRT 

<213> NM 005682 GPRS 6 



3702 



<400> 20 

Met Thr Pro Gin Ser Leu Leu Gin Thr Thr Leu Phe Leu Leu Ser Leu 
1 5 10 15 

Leu Phe Leu Val Gin Gly Ala His Gly Arg Gly, His Arg Glu Asp Phe 
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25 



30 



Arg Phe Cys Ser Gin Arg Asn Gin Thr His Arg Ser Ser Leu His Tyr 
35 40 45 



Lys Pro Thr Pro Asp Leu Arg lie Ser lie Glu Asn Ser Glu Glu Ala 
50 * 55 60 



Leu Thr Val His Ala Pro Phe Pro Ala Ala His Pro Ala Ser Arg Ser 
65 70 75 80 



Phe Pro Asp Pro Arg Gly Leu Tyr His Phe Cys Leu Tyr Trp Asn Arg 
85 90 95 



His Ala Gly Arg Leu His Leu Leu Tyr Gly Lys Arg Asp Phe Leu Leu 
100 105 110 



Ser Asp Lys Ala Ser Ser Leu Leu Cys Phe Gin His Gin Glu Glu Ser 
115 120 125 



Leu Ala Gin Gly Pro Pro Leu Leu Ala Thr Ser Val Thr Ser Trp Trp 
130 135 140 



Ser Pro Gin Asn lie Ser Leu Pro Ser. Ala Ala Ser Phe Thr Phe Ser 
145 150 155 160 



Phe His Ser Pro Pro His Thr Ala Ala His Asn Ala Ser Val Asp Met 
165 170 175 



Cys Glu Leu Lys Arg Asp Leu Gin Leu Leu Ser Gin Phe Leu Lys His. 
180 185 190 



Pro Gin Lys Ala Ser Arg Arg Pro Ser Ala Ala Pro Ala Ser Gin Gin 
195 200 205 



Leu Gin Ser Leu Glu Ser Lys Leu Thr Ser Val Arg Phe Met Gly Asp 
210 215 220 



Met Val Ser Phe Glu Glu Asp Arg He Asn Ala Thr Val Trp Lys Leu 
225 230 235 240 



Gin Pro Thr Ala Gly Leu Gin Asp Leu His He His Ser Arg Gin Glu 
245 250 255 



Glu Glu Gin Ser Glu He Met Glu Tyr Ser Val Leu Leu Pro Arg Thr 
260 265 270 



Leu Phe Gin Arg Thr Lys Gly Arg Ser Gly Glu Ala Glu Lys Arg Leu 
275 280 285 



Leu Leu Val Asp Phe Ser Ser Gin Ala Leu Phe Gin Asp Lys Asn Ser 
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290 295 300 

Ser His Val Leu Gly Glu Lys Val Leu Gly lie Val Val Gin Asn Thr 
305 310 315 320 

Lys Val Ala Asn Leu Thr Glu Pro Val Val Leu Thr Phe Gin His Gin 
325 330 335 

Leu Gin Pro Lys Asn Val Thr Leu Gin Cys Val Phe Trp Val Glu Asp 
340 345 350 

Pro Thr Leu Ser Ser Pro Gly His Trp Ser Ser Ala Gly Cys Glu Thr 
355 360 365 

Val Arg Arg Glu Thr Gin Thr Ser Cys Phe Cys Asn His Leu Thr Tyr 
370 375 380 

Phe Ala Val Leu Met Val Ser Ser Val Glu Val Asp Ala Val His Lys 
385 390 395 400 

His Tyr Leu Ser Leu Leu Ser Tyr Val Gly Cys Val Val Ser Ala Leu 
405 410 415 

Ala Cys Leu Val Thr lie Ala Ala Tyr, Leu Cys Ser Arg Val Pro Leu 
420 425 430 

Pro Cys Arg Arg Lys Pro Arg Asp Tyr Thr lie Lys Val His Met Asn 
435 440 445 

Leu Leu Leu Ala Val Phe Leu Leu Asp Thr Ser Phe Leu Leu Ser Glu 
450 455 .460 

Pro Val Ala Leu Thr Gly Ser Glu Ala Gly Cys Arg Ala Ser Ala He 
465 470 475 480 

Phe Leu His Phe Ser Leu Leu Thr Cys Leu Ser Trp Met Gly Leu Glu 
485 490 495 

Gly Tyr Asn Leu Tyr Arg Leu Val Val Glu Val Phe Gly Thr Tyr Val 
500 505 510 

Pro Gly Tyr Leu Leu Lys Leu Ser Ala Met Gly Trp Gly Phe Pro He 
515 520 525 



Phe Leu Val Thr Leu Val Ala Leu Val Asp Val Asp Asn Tyr Gly Pro 
530 535 540 



He He Leu Ala Val His Arg Thr Pro Glu Gly Val He Tyr Pro Ser 
545 550 555 560 
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Met Cys Trp "lie Arg Asp Ser Leu Val Ser Tyr He Thr Asn Leu Gly 
565 570 575 



Leu Phe Ser Leu Val Phe Leu Phe Asn Met Ala Met Leu Ala Thr Met 
580 585 590 



Val Val Gin He Leu Arg Leu Arg Pro His Thr Gin Lys Trp Ser His 
595 600 605 



Val Leu Thr Leu Leu Gly Leu Ser Leu Val Leu Gly Leu Pro Trp Ala 
610 615 620 



Leu He Phe Phe Ser Phe Ala Ser Gly Thr Phe Gin Leu Val Val Leu 
625 630 . 635 640 



Tyr Leu Phe Ser He He Thr Ser Phe Gin Gly Phe "Leu He Phe He 
645 650 655 



Trp Tyr Trp Ser Met Arg Leu Gin Ala Arg Gly Gly Pro Ser Pro Leu 
660 665 670 



Lys Ser Asn Ser Asp Ser Ala Arg Leu Pro He Ser Ser Gly Ser Thr 
675 680 685 



Ser Ser Ser Arg He 
690 



<210> 21 

<211> 1332 

<212> DNA 

<213> NM 001307 claudin 7, CLDN7 



<220> 

<221> CDS 

<222> (427) . . (1059) 

<223> 



<400> 21 
ctgctggctc 


acctccgagc 


cacctctgct gcgcaccgca gcctcggacc tacagcccag 


60 


gatactttgg 


gacttgccgg 


cgctcagaaa cgcgcccaga cggcccctcc accttttgtt 


120 


tgcctagggt 


cgccgagagc 


gcccggaggg aaccgcctgg ccttcgggga ccaccaattt 


180 


tgtctggaac 


caccctcccg 


gcgtatccta ctccctgtgc cgcgaggcca tcgcttcact 


240 


ggaggggtcg 


atttgtgtgt 


agtttggtga caagatttgc attcacctgg cccaaaccct 


300 


ttttgtctct 


ttgggtgacc 


ggaaaactcc acctcaagtt ttcttttgtg gggctgcccc 


360 
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ccaagtgtcg tttgttttac tgtagggtct cccgcccggc gcccccagtg ttttctgagg 420 

gcggaa atg gcc aat teg ggc ctg cag ttg ctg ggc ttc tec atg gee 
Met Ala Asn Ser Gly Leu Gin Leu Leu Gly Phe Ser Met Ala 
15 10 



cag atg age tec tat gcg ggt gac aac ate ate acg gcc cag gcc atg 
Gin Met Ser Ser Tyr Ala Gly Asp Asn He He Xhr Ala Gin Ala Met 
35 40 45 



age tgc aaa atg tac gac teg gtg etc gcc ctg tec gcg gcc ttg cag 
Ser Cys Lys Met Tyr Asp Ser Val Leu Ala Leu Ser Ala Ala Leu Gin 
65 70 75 

gcc act cga gcc eta atg gtg gtc tec ctg gtg ctg ggc ttc ctg gcc 
Ala Thr Arg Ala Leu Met Val Val Ser Leu Val Leu Gly Phe Leu Ala 
80 85 90 



gac aaa gtg aag aag gcc cgt ata gcc atg ggt gga ggc ata att ttc 
Asp Lys Val Lys Lys Ala Arg He Ala Met Gly Gly Gly He He Phe 
115 120 125 

ate gtg gca ggt ctt gcc ace ttg gta get tgc tec tgg tat ggc cat 
He Val Ala Gly Leu Ala Thr Leu Val Ala Cys Ser Trp Tyr Gly His 
130 135 140 

cag att gtc aca gac ttt tat aac cct ttg ate cct acc aac att aag 
Gin lie Val Thr Asp Phe Tyr Asn Pro Leu He Pro Thr Asn He Lys 
145 150 155 

tat gag ttt ggc cct gcc ate ttt att ggc tgg gca ggg tct gcc eta 
Tyr Glu Phe Gly Pro Ala He Phe He Gly Trp Ala Gly Ser Ala Leu 
160 165 170 



age aag get ggg tac cgt gca ccc cgc tct tac cct aag tec aac tct 
Ser Lys Ala Gly Tyr Arg Ala Pro Arg Ser Tyr Pro Lys Ser Asn Ser 
195 200 205 



468 



ctg ctg ggc tgg gtg ggt ctg gtg gcc tgc acc gcc ate ccg cag tgg 516 
Leu Leu Gly Trp Val Gly Leu Val Ala Cys Thr Ala He Pro Gin Trp 
15 20 25 30 



564 



tac aag ggg ctg tgg atg gac tgc gtc acg cag age acg ggg atg atg 612 
Tyr Lys Gly Leu Trp Met Asp Cys Val Thr Gin Ser Thr Gly Met Met 
50 55 60 



660 



708 



atg ttt gtg gcc acg atg ggc atg aag tgc acg cgc tgt ggg gga gac 756 
Met Phe Val Ala Thr Met Gly Met Lys Cys Thr Arg Cys Gly Gly Asp 
95 100 105 HO 



804 



852 



900 



948 



gtc ate ctg gga ggt gca ctg etc tec tgt tec tgt cct ggg aat gag 996 
Val He Leu Gly Gly Ala Leu Leu Ser Cys Ser Cys Pro Gly Asn Glu 
175 180 185 190 



1044 



tec aag gag tat gtg tgacctggga tctccttgcc ccagcctgac aggctatggg 1099 
Ser Lys Glu Tyr Val 
210 

agtgtctaga tgcctgaaag ggcctggggc tgagctcagc ctgtgggcag ggtgccggac 1159 

aaaggcctcc tggtcactct gtccctgcac tccatgtata gtcctcttgg gttgggggtg 1219 

ggggggtgcc gttggtggga gagacaaaaa gagggagagt gtgctttttg tacagtaata 1279 

aaaaataagt attgggaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 1332 
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<210> 


22 


<211> 


211 


<212> 


PRT 


<213> 


NMJ 


<400> 


22 



Met Ala Asn Ser Gly Leu Gin Leu Leu Gly Phe Ser Met Ala Leu Leu 
15 10 15 



Gly Trp Val Gly Leu Val Ala Cys Thr Ala lie Pro Gin Trp Gin Met 
20 25 - 30 



Ser Ser Tyr Ala Gly Asp Asn lie lie Thr Ala Gin Ala Met Tyr Lys 
35 40 45 



Gly Leu Trp Met Asp Cys Val Thr Gin Ser Thr Gly Met Met Ser Cys 
50 55 60 



Lys Met Tyr Asp Ser Val Leu Ala Leu Ser Ala Ala Leu Gin Ala Thr 
65 * 70 75 80 



Arg Ala Leu Met Val Val Ser Leu Val Leu Gly Phe Leu Ala Met Phe 
85 - 90 95 



Val Ala Thr Met Gly Met Lys Cys Thr Arg Cys Gly Gly Asp Asp Lys 
100 105 HO 



Val Lys Lys Ala Arg He Ala Met Gly Gly Gly He He Phe He Val 
115 120 125 



Ala Gly Leu Ala Thr Leu Val Ala Cys Ser Trp Tyr Gly His Gin He 
130 135 140 

Val Thr Asp Phe Tyr Asn Pro Leu He Pro Thr Asn He Lys Tyr Glu 
145 150 155 160 



Phe Gly Pro Ala He Phe He Gly Trp Ala Gly Ser Ala Leu Val He 
165 170 175 



Leu Gly Gly Ala Leu Leu Ser Cys Ser Cys Pro Gly Asn Glu Ser Lys 
180 185 190 



Ala Gly Tyr Arg Ala Pro Arg Ser Tyr Pro Lys Ser Asn Ser Ser Lys 
195 200 205 



Glu Tyr Val 
210 
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<210> 23 

<211> 888 

<212> DNA 

<213> NM_014736 KIAA0101 gene product 
<220> 

<221> CDS 

<222> (75). .(407) 

<223> 



<400> 23 

ggggccttgt ccagtgaaac accctcggct gggaagtcag ttcgttctct cctctcctct 60 

cttcttgttt gaac atg gtg egg act aaa gca gac agt gtt cca ggc act 
Met Val Arg Thr Lys Ala Asp Ser Val Pro Gly Thr 
1 5 .10 



tac aga aaa gtg gtg get get cga gcc'ccc aga aag gtg ctt ggt tct 
Tyr Arg Lys Val Val Ala Ala Arg Ala Pro Arg Lys Val Leu Gly Ser 
15 20 25 

tec ace tct gec act aat teg aca tea gtt tea teg agg aaa get gaa 
Ser Thr Ser Ala Thr Asn Ser Thr Ser Val Ser Ser Arg Lys Ala Glu 
30 35 40 

aat aaa tat gca gga ggg aac ccc gtt tgc gtg cgc cca act ccc aag 
Asn Lys Tyr Ala Gly Gly Asn Pro Val Cys Val Arg Pro Thr Pro Lys 
45 50 55 60 

tgg caa aaa gga att gga gaa ttc ttt agg ttg tec cct aaa gat tct 
Trp Gin Lys Gly He Gly Glu Phe Phe Arg Leu Ser Pro Lys Asp Ser 
65 70 75 

gaa aaa gag aat cag att cct gaa gag gca gga age agt ggc tta gga 
Glu Lys Glu Asn Gin He Pro Glu Glu Ala Gly Ser Ser Gly Leu Gly 
80 85 90 

aaa gca aag aga aaa gca tgt cct ttg caa cct gat cac aca aat gat 
Lys Ala Lys Arg Lys Ala Cys Pro Leu Gin Pro Asp His Thr Asn Asp 
95 100 105 



110 



158 



206 



254 



302 



350 



398 



gaa aaa gaa tagaactttc tcattcatct ttgaataacg tctccttgtt 


447 


Glu Lys Glu 












110 














taccctggta 


ttctagaatg 


taaatttaca 


taaatgtgtt 


tgttccaatt 


agctttgttg 


507 


aacaggcatt 


taattaaaaa 


atttaggttt 


aaatttagat 


gttcaaaagt 


agttgtgaaa 


567 


tttgagaatt 


tgtaagacta 


attatggtaa 


cttagcttag 


tattcaatat 


aatgcattgt 


627 


ttggtttctt 


ttaccaaatt 


aagtgtctag 


ttcttgctaa 


aatcaagtca 


ttgcattgtg 


687 


ttctaattac 


aagtatgttg 


tatttgagat 


ttgettagat 


tgttgtactg 


ctgccatttt 


747 


tattggtgtt 


tgattattgg 


aatggtgcca 


tattgtcact 


ccttctactt 


gctttaaaaa 


807 


gcagagttag 


atttttgeae 


attaaaaaaa 


ttcagtatta 


attaaacact 


aaaaaaaaaa 


867 
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888 



aaaaaaaaaa aaaaaaaaaa a 

<210> 24 
<211> 111 
<212> PRT 

<213> NM_014736 KIAA0101 gene product . 
<400> 24 

Met Val Arg Thr Lys Ala Asp Ser Val Pro Gly Thr Tyr Arg Lys Val 
15 10 15 

Val Ala Ala Arg Ala Pro Arg Lys Val Leu Gly Ser Ser Thr Ser Ala 
20 25 30 

Thr Asn Ser Thr Ser Val Ser Ser Arg Lys Ala Glu Asn Lys Tyr Ala 
35 40 45 

Gly Gly Asn Pro Val Cys Val Arg Pro Thr Pro Lys Trp Gin Lys Gly 
50 55 60 . 

lie Gly Glu Phe Phe Arg Leu Ser Pro Lys Asp Ser Glu Lys Glu Asn 
65 70 75 80 

Gin lie Pro Glu Glu Ala Gly Ser Ser Gly Leu Gly Lys Ala Lys Arg 
85 90 , 95 

Lvs Ala Cys Pro Leu Gin Pro Asp His Thr Asn Asp Glu Lys Glu 
100 105 HO 



<210> 25 

<211> 598 

<212> DNA 

<213> NM_003064 secretory leukocyte protease inhibitor, SLPI 

<220> 

<221> CDS 

<222> (23).. (418) 

<223> 



cagagtcact cctgccttca cc atg aag tec age ggc etc ttc ccc ttc ctg 52 
* y Met Lys Ser Ser Gly Leu Phe Pro Phe Leu 

1 5 10 
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gtg ctg ctt gcc ctg gga act ctg gca cct tgg get gtg gaa ggc tct 
Val Leu Leu Ala Leu Gly Thr Leu Ala Pro Trp Ala Val Glu Gly Ser 
15 .20 25 

gga aag tec ttc aaa get gga gtc tgt cct cct aag aaa tct gcc cag 
Gly Lys Ser Phe Lys Ala Gly Val Cys Pro Pro Lys Lys Ser Ala Gin 
30 35 .40 

tgc ctt aga tac aag aaa cct gag tgc cag agt gac tgg cag tgt cca 
Cys Leu Arg Tyr Lys Lys Pro Glu Cys Gin Ser Asp Trp Gin Cys Pro 
45 50 55 

ggg aag aag aga tgt tgt cct gac act tgt ggc ate aaa tgc ctg gat 
Gly Lys Lys Arg Cys Cys Pro Asp Thr Cys Gly He Lys Cys Leu Asp 
60 65 70 

cct gtt gac acc cca aac cca aca agg agg aag cct ggg aag tgc cca 
Pro Val Asp Thr Pro Asn Pro Thr Arg Arg Lys Pro Gly Lys Cys Pro 
75 80 . 85 90 

gtg act tat ggc caa tgt ttg atg ctt aac ccc ccc aat ttc tgt gag 
Val Thr Tyr Gly Gin Cys Leu Met Leu Asn Pro Pro Asn Phe Cys Glu 
95 100 . . 105 

atg gat ggc cag tgc aag cgt gac ttg aag tgt tgc atg ggc atg tgt 
Met Asp Gly Gin Cys Lys Arg Asp Leu Lys Cys Cys Met Gly Met Cys 
110 H5 120 

ggg aaa tec tgc gtt tec cct gtg aaa get tgattcctgc catatggagg 
Gly Lys Ser Cys Val Ser Pro Val Lys Ala 
125 130 

aggctctgga gtcctgctct gtgtggtcca ggtcctttcc accctgagac ttggctccac 
cactgatatc ctcctttggg gaaaggcttg gcacacagca ggctttcaag aagtgccagt 
• tgatcaatga ataaataaac gagectattt ctctttgcac 

<210> 26 

<2U> 132 

<212> PRT 

<213> NM_003064 secretory leukocyte protease inhibitor, SLPI 



<400> 26 

Met Lys Ser Ser Gly Leu Phe Pro Phe Leu Val Leu Leu Ala Leu Gly 
.! 5 10 15 



100 

148 

196 

244 

292 

340 

388 

438 

498 
558 
598 



Thr Leu Ala Pro Trp Ala Val Glu Gly Ser Gly Lys Ser Phe Lys Ala 
20 25 30 

Gly Val Cys Pro Pro Lys Lys Ser Ala Gin Cys Leu Arg Tyr Lys Lys 
35 40 45 

Pro Glu Cys Gin Ser Asp Trp Gin Cys Pro Gly Lys Lys Arg Cys Cys 
50 55 60 
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Pro Asp Thr Cys Gly He Lys Cys Leu Asp Pro Val Asp Thr Pro Asn 
65 70 75 80 

Pro Thr Arg Arg Lys Pro Gly Lys Cys Pro Val Thr Tyr Gly Gin Cys 
85 90 95 



Leu Met Leu Asn Pro Pro Asn Phe Cys Glu Met Asp Gly Gin Cys Lys 
100 105 . HO 

Arg Asp Leu Lys Cys Cys Met Gly Met Cys Gly Lys Ser Cys Val Ser 
115 120 125 



Pro Val Lys Ala 
130 



<210> 27 

<211> 3970 

<212> DNA 

<213> NM_013994 DDR1 

<220> 

<221> CDS 

<222> (337) . . (3093) 

<223> 



<400> 27 

ggcttaggaa gtattaactg atctctgccc tagttctcat gtgttaaata tggatagtaa 



60 
120 



tagtatctac cttatgaagt gactgtgaag ataaaattat ggattctgtt taagggttta 

ggccagtgtc tggcacaggg gaagcattct aaaaatatag ctgatgctgt taaacaatga 180 

ctgttgttgt tgttttactg ttattatccc caaagcggcc cattctgtct gttgctgtca 240 

gctatgactc agtcccctga ttaacttacg caccacccat tttatcccct gcagagatgc 300 

tgcccccacc cccttaggcc cgagggatca ggagct atg gga cca gag gcc ctg 354 

Met Gly Pro Glu Ala Leu 
1 5 

tea tct tta ctg ctg ctg etc ttg gtg gca agt gga gat get gac atg 
Ser Ser Leu Leu Leu Leu Leu Leu Val Ala Ser Gly Asp Ala Asp Met 
10 15 20 

aag gga cat ttt gat cct gcc aag tgc cgc tat gcc ctg ggc atg cag 
Lys Gly His Phe Asp Pro Ala Lys Cys Arg Tyr Ala Leu Gly Met Gin 
25 30 35 

gac egg ace ate cca gac agt gac ate tct get tec age tec tgg tea 
Asp Arg Thr He Pro Asp Ser Asp He Ser Ala Ser Ser Ser Trp Ser 
40 45 50 

gat tec act gcc gcc cgc cac age agg ttg gag age agt gac ggg gat 



402 



450 



498 



546 
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Asp Ser Thr Ala Ala Arg His Ser Arg Leu Glu Ser Ser Asp Gly Asp 
55 60 65 70 



642 



690 



738 



ggg gcc tgg tgc ccc gca ggg teg gtg ttt ccc aag gag gag gag tac 594 
Gly Ala Trp Cys Pro Ala Gly Ser Val Phe Pro Lys Glu Glu Glu Tyr 
75 80 85 

ttg cag gtg gat eta caa cga ctg cac ctg gtg get ctg gtg ggc ace 
Leu Gin Val Asp Leu Gin Arg Leu His Leu Val Ala Leu Val Gly Thr 
90 95 • 100 

cag gga egg cat gcc ggg ggc ctg ggc aag gag ttc tec egg age tac 
Gin Gly Arg His Ala Gly Gly Leu Gly Lys Glu Phe Ser Arg Ser Tyr 
105 110 115 

egg ctg cgt tac tec egg gat ggt cgc cgc tgg atg ggc tgg aag gac 
Arg Leu Arg Tyr iSer Arg Asp Gly Arg Arg Trp Met Gly Trp Lys Asp 
120 125 130 

cgc tgg ggt cag gag gtg ate tea ggc aat gag gac cct gag gga gtg 786 
Arg Trp Gly Gin Glu Val He Ser Gly Asn Glu Asp Pro Glu Gly Val 
135 140 145 150 

gtg ctg aag gac ctt ggg ccc ccc atg gtt gcc cga ctg gtt cgc ttc 834 
Val Leu Lys Asp Leu Gly Pro Pro Met Val Ala Arg Leu Val Arg Phe 
155 160 . 165 

tac ccc egg get gac egg gtc atg age gtc tgt ctg egg gta gag etc 882 
Tyr Pro Arg Ala Asp Arg Val Met Ser Val Cys Leu Arg Val Glu Leu 
170 175 180 

tat ggc tgc etc tgg agg gat gga etc ctg. tct tac ace gcc cct gtg 
Tyr Gly Cys Leu Trp Arg Asp Gly Leu Leu Ser Tyr Thr Ala Pro Val 
185 190 195 

ggg cag aca atg tat tta tct gag gcc gtg tac etc aac gac tec acc 
Gly Gin Thr Met Tyr Leu Ser Glu Ala Val Tyr Leu Asn Asp Ser Thr 
200 205 210 

tat gac gga cat acc gtg ggc gga ctg cag tat ggg ggt ctg ggc cag 
Tyr Asp Gly His Thr Val Gly Gly Leu Gin Tyr Gly Gly Leu Gly Gin 
215. 220 225 230 

ctg gca gat ggt gtg gtg ggg ctg gat gac ttt agg aag agt cag gag 1074 
Leu Ala Asp Gly Val Val Gly Leu Asp Asp Phe Arg Lys Ser Gin Glu 
235 240 245 

ctg egg gtc tgg cca ggc tat gac tat gtg' gga tgg age aac cac age 1122 
Leu Arg Val Trp Pro Gly Tyr Asp Tyr Val Gly Trp Ser Asn His Ser 
250 255 260 

ttc tec agt ggc tat gtg gag atg gag ttt gag ttt gac egg ctg agg 1170 
Phe Ser Ser Gly Tyr Val Glu Met Glu Phe Glu Phe Asp Arg Leu Arg 
265 270 275 

gcc ttc cag get atg cag gtc cac tgt aac aac atg cac acg ctg gga 1218 
Ala Phe Gin Ala Met Gin Val His Cys Asn Asn Met His Thr Leu Gly 
280 285 290 

gcc cgt ctg cct ggc ggg gtg gaa tgt cgc ttc egg cgt ggc cct gcc 1266 
Ala Arg Leu Pro Gly Gly Val Glu Cys Arg Phe Arg Arg Gly Pro Ala 
295 300 305 310 

atg gcc tgg gag ggg gag ccc atg cgc cac aac eta ggg ggc aac ctg 1314 
Met Ala Trp Glu Gly Glu Pro Met Arg His Asn Leu Gly Gly Asn Leu 
315 320 325 



930 



978 



1026 
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ggg gac ccc aga gcc egg get gtc tea gtg ccc ctt ggc ggc cgt gtg 
Gly Asp Pro Arg Ala Arg Ala Val Ser Val Pro Leu Gly Gly Arg Val 
330 335 340 

get cgc ttt ctg cag tgc cgc ttc etc ttt gcg ggg ccc tgg tta etc 
Ala Arg Phe Leu Gin Cys Arg Phe Leu Phe Ala Gly Pro Trp Leu Leu 
345 350 355 

ttc age gaa ate tec ttc ate tct gat gtg gtg aac aat tec tct ccg 
Phe Ser Glu He Ser Phe He Ser Asp Val Val Asn Asn Ser Ser Pro 
360 365 . 370 

gca ctg gga ggc ace ttc ccg cca gcc ccc tgg tgg ccg cct ggc cca 
Ala Leu Gly Gly Thr Phe Pro Pro Ala Pro Trp Trp Pro Pro Gly Pro 
375 380 385 390 

cct ccc ace aac ttc age age ttg gag ctg gag ccc aga ggc cag cag 
Pro Pro Thr Asn Phe Ser Ser Leu Glu Leu Glu Pro Arg Gly Gin Gin 
395 400 405 

ccc gtg gcc aag gcc gag ggg age ccg acc gcc ate etc ate ggc tgc 
Pro Val Ala Lys Ala Glu Gly Ser Pro Thr Ala He Leu He Gly Cys 
410 ' 415 420 

ctg gtg gcc ate ate ctg etc ctg ctg etc ate att gcc etc atg etc 
Leu Val Ala He He Leu Leu Leu Leu Leu He He Ala Leu Met Leu 
425 430 435 

tgg egg ctg cac tgg cgc agg etc etc age aag get gaa egg agg gtg 
Trp Arg Leu His Trp Arg Arg Leu Leu Ser Lys Ala Glu Arg Arg Val 
440 445 450 



ttg gaa gag gag ctg acg gtt cac etc tct gtc cct ggg gac act ate 
Leu Glu Glu Glu Leu Thr Val His Leu Ser Val Pro Gly Asp Thr He 
455 460 465 470 

etc ate aac aac cgc cca ggt cct aga gag cca ccc ccg tac cag gag 
Leu He Asn Asn Arg Pro Gly Pro Arg:Glu .Pro Pro Pro Tyr Gin Glu 
475 480 485 

ccc' egg cct cgt ggg aat ccg ccc cac tec get ccc tgt gtc ccc aat 
Pro Arg Pro Arg Gly Asn Pro Pro His Ser Ala Pro Cys Val Pro Asn 
490 495 500 

ggc tct gcg ttg ctg etc tec aat cca gcc tac cgc etc ctt ctg gcc 
Gly Ser Ala Leu Leu Leu Ser Asn Pro Ala Tyr Arg Leu Leu Leu Ala 
505 510 515 

act tac gcc cgt ccc cct cga ggc ccg ggc ccc ccc aca ccc gcc tgg 
Thr Tyr Ala Arg Pro Pro Arg Gly Pro Gly Pro Pro Thr Pro Ala Trp 
520 " 525 530 

gcc aaa ccc acc aac acc cag gcc tac agt ggg gac tat atg gag cct 
Ala Lys Pro Thr Asn Thr Gin Ala Tyr Ser Gly Asp Tyr Met Glu Pro 
535 540 545 550 

gag aag cca ggc gcc ccg ctt ctg ccc cca cct ccc cag aac age gtc 
Glu Lys Pro Gly Ala Pro Leu Leu Pro Pro Pro Pro Gin Asn Ser Val 
555 560 565 

ccc cat tat gcc gag get gac att gtt acc ctg cag ggc gtc acc ggg 
Pro His Tyr Ala Glu Ala Asp He Val Thr Leu Gin Gly Val Thr Gly 
570 575 580 



1362 
1410 
1458 
1506 
1554 
1602 
1650 
1698 

1746 
1794 
1842 
1890 
1938 
1986 
2034 
2082 
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ggc aac acc tat get gtg cct gca ctg ccc cca ggg gca gtc ggg gat 2130 
Gly Asn Thr Tyr Ala Val Pro Ala Leu Pro Pro Gly Ala Vai Gly Asp 
585 590 595 

ggg ccc ccc aga gtg gat ttc cct cga tct cga etc cgc ttc aag gag 2178 
Gly Pro Pro Arg Val Asp Phe Pro Arg Ser Arg Leu Arg Phe Lys Glu 
600 605 610 

aag ctt ggc gag ggc cag ttt ggg gag gtg cac ctg tgt gag gtc gac 2226 
Lys Leu Gly Glu Gly Gin Phe Gly Glu Val His Leu Cys Glu Val Asp 
615 "* 620 625 630 

age cct caa gat ctg gtt agt ctt gat ttc ccc ctt aat gtg cgt aag 2274 
Ser Pro Gin Asp Leu Val Ser Leu Asp Phe Pro. Leu Asn Val Arg Lys 
635 640 645 

gga cac cct ttg ctg gta get gtc aagatc.tta egg cca gat gee acc 2322 
Gly His Pro Leu Leu Val Ala Val Lys lie Leu Arg Pro Asp Ala Thr 
650 655 660 

aag aat gee age ttc tec ttg ttc tec agg aat gat ttc ctg aaa gag 2370 
Lys Asn Ala Ser Phe Ser Leu Phe Ser Arg Asn Asp Phe Leu Lys Glu 
665 670 675 

gtg aag ate atg teg agg etc aag gac cca aac ate att egg ctg ctg 2418 
Val Lys He Met Ser Arg Leu Lys Asp Pro Asn He He Arg Leu Leu 
680 685 690 ' 

ggc gtg tgt gtg cag gac gac ccc etc tgc atg att act gac tac atg 2466 
Gly Val Cys Val Gin Asp Asp Pro Leu Cys Met He Thr Asp Tyr Met 
695 700 705 710 

gag aac ggc gac etc aac cag ttc etc agt gee cac cag ctg gag gac 2514 
Glu Asn Gly Asp Leu Asn Gin Phe Leu Ser Ala His Gin Leu Glu Asp 
715 720 725 

aag gca gee gag ggg gee cct ggg gac ggg cag get gcg cag ggg ccc 2562 
Lys Ala Ala Glu Gly Ala Pro Gly Asp Gly Gin Ala Ala Gin Gly Pro 
730 735 740 

acc ate age tac cca atg ctg ctg cat gtg gca gee cag ate gee tec 2610 
Thr He Ser Tyr Pro Met Leu Leu His Val Ala Ala Gin He Ala Ser 
745 750 755 

ggc atg cgc tat ctg gee aca etc aac ttt gta cat egg gac ctg gee 2658 
Gly Met Arg Tyr Leu Ala Thr Leu Asn Phe Val His Arg Asp Leu Ala 
760 765 770 

acg egg aac tgc eta gtt ggg gaa aat ttc acc ate aaa ate gca gac 
Thr Arg Asn Cys Leu Val Gly Glu Asn Phe Thr He Lys He Ala Asp 
775 780 785 790 

ttt ggc atg age egg aac etc tat get ggg gac tat tac cgt gtg cag 2754 
Phe Gly Met Ser Arg Asn Leu Tyr Ala Gly Asp Tyr Tyr Arg Val Gin 
795 800 805 

ggc egg gca gtg ctg ccc ate cgc tgg atg gee tgg gag tgc ate etc 2802 
Gly Arg Ala Val Leu Pro He Arg Trp Met Ala Trp Glu Cys He Leu 
810 815 820 

atg ggg aag ttc acg act gcg agt gac gtg tgg gee ttt ggt gtg acc 
Met Gly Lys Phe Thr Thr Ala Ser Asp Val Trp Ala Phe Gly Val Thr 
825 830 835 



2706 



2850 



ctg tgg gag gtg ctg atg etc tgt agg gee cag ccc ttt ggg tea get 
Leu Trp Glu Val Leu Met Leu Cys Arg Ala Gin Pro Phe Gly Ser Ala 



2898 
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840 



845 850 



cac cga cga gca ggt cat cga gaa cgc ggg gga gtt ctt ccg gga cca 
His Arg Arg Ala Gly His Arg Glu Arg Gly Gly Val Leu Pro Gly Pro 
855 860 865 870 

ggg ccg gca gtg tac ctg tec egg- ccg cct gec tgc ccg cag ggc eta 
Gly Pro Ala Val Tyr Leu Ser Arg Pro Pro Ala Cys Pro Gin Gly Leu 
875 880 885 

tat gag ctg atg ctt egg tgc tgg age egg gag tct gag cag cga cca 
Tvr Glu Leu Met Leu Arg Cys Trp Ser Arg Glu Ser Glu Gin Arg Pro 
890 895 900 

ccc ttt tec cag ctg cat egg ttc ctg gca gag gat gca etc aac acg 
Pro Phe Ser Gin Leu His Arg Phe Leu Ala Glu Asp Ala Leu Asn Thr 
905 910 915 

gtg tgaatcacac atccagctgc ccctccctca gggagegate caggggaagc 
Val 



2946 



2994 



3042 



3090 



3143 



3203 
3263 
3323 
3383 
3443 
3503 
3563 
3623 
3683 



cagtgacact aaaacaagag gacacaatgg cacctctgcc cttcccctcc cgacagccca 
tcacctctaa tagaggcagt gagactgeag gtgggctggg cccacccagg gagctgatgc 
cccttctccc cttcctggac acactctcat gtccccttcc tgttcttcct tcctagaagc 
ccccctgtcg cccacccagc tggtcctgtg gatgggatcc tctccaccct cctctagcca 
tcccttgggg aagggtgggg agaaatatag gatagacact ggacatggcc cattggagca 
cctgggcccc actggacaac actgattcct ggagaggtgg ctgcgccccc agcttctctc 
tccctgtcac acactggacc ccactggctg agaatctggg ggtgaggagg acaagaagga 
gaggaaaatg tttccttgtg cctgctcctg tacttgtcct cagcttgggc ttcttcctcc 
tccatcacct gaaacactgg acctgggggt agccccgccc cagccctcag tcacccccac 
ttcccacttg cagtcttgta gctagaactt ctctaagcct ataegtttet gtggagtaaa 3743 
tattgggatt ggggggaaag agggagcaac ggcccatagc cttggggttg gacatctcta 3803 
gtgtagctgc cacattgatt tttctataat cacttggggt ttgtacattt ttggggggag 3863 
agacacagat ttttacacta atatatggac ctagcttgag gcaattttaa tcccctgcac 3923 
taggcaggta ataataaagg ttgagttttc cacaaaaaaa aaaaaaa 

<210> 28 

<211> 919 

<212>. PRT 

<213> NM_013994 DDR1 

<400> 28 

Met Glv Pro Glu Ala Leu Ser Ser Leu Leu Leu Leu Leu Leu Val Ala 
1 5 1° 15 



3970 



Ser. Gly Asp Ala Asp Met Lys Gly His Phe Asp Pro Ala Lys Cys Arg 
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20 



25 



30 



Tyr Ala Leu Gly Met Gin Asp Arg. Thr He Pro Asp Ser Asp He Ser 
35 40 45 



Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala Ala Arg His Ser Arg Leu 
50 55 60 



Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys Pro Ala Gly Ser Val Phe 
65 70 75 80 



Pro Lys Glu Glu Glu Tyr Leu Gin Val Asp Leu Gin Arg Leu His Leu 
85 90 95 



Val Ala Leu Val Gly Thr Gin Gly Arg His Ala Gly Gly Leu Gly Lys 
100 105 HO 



Glu Phe Ser Arg Ser Tyr Arg Leu Arg Tyr Ser Arg Asp Gly Arg Arg 
115 120 125 



Trp Met Gly Trp Lys Asp Arg Trp Gly Gin Glu Val He Ser Gly Asn 
130 135 140 



Glu Asp Pro Glu Gly Val Val Leu Lys Asp Leu Gly Pro Pro Met Val 
145 " 150 155 160 



Ala Arg Leu Val Arg Phe Tyr Pro Arg Ala Asp Arg Val Met Ser Val 
165 170 175 



Cys Leu Arg Val Glu Leu Tyr Gly Cys Leu Trp Arg Asp Gly Leu Leu 
180 185 190 



Ser Tyr Thr Ala Pro Val Gly Gin Thr Met Tyr Leu Ser Glu Ala Val 
195 200 205 



Tyr Leu Asn Asp Ser Thr Tyr Asp Gly His Thr Val Gly. Gly Leu Gin 
210 215 220 



Tyr Gly Gly Leu Gly Gin Leu Ala Asp Gly Val Val Gly Leu Asp Asp 
225 230 235 240 



Phe Arg Lys Ser Gin Glu Leu Arg Val Trp Pro Gly Tyr Asp Tyr Val 
245 250 255 



Gly Trp Ser Asn His Ser Phe Ser Ser Gly Tyr Val .Glu Met Glu - Phe 
. 260 265 270 



Glu Phe Asp Arg Leu Arg Ala Phe Gin Ala Met Gin Val His Cys Asn 
275 280 285 



Asn Met His Thr Leu Gly Ala Arg Leu Pro Gly Gly Val Glu Cys Arg 
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290 



295 



300 . 



Phe Arg Arg Gly Pro Ala Met Ala Trp Glu Gly Glu Pro Met Arg His 



305 



310 



315 



Asn Leu Gly Gly Asn Leu Gly Asp Pro Arg Ala Arg Ala Val Ser Val 

^30 335 



325 



Pro Leu Gly Gly Arg Val Ala Arg Phe Leu Gin Cys Arg Phe Leu Phe 
340 345. 350 

Ala Gly Pro Trp Leu Leu Phe Ser Glu He Ser Phe He Ser Asp Val 
355 360 365 

Val Asn Asn Ser Ser Pro Ala Leu Gly Gly Thr Phe Pro Pro Ala Pro 
370 375 380 



Trp Trp Pro Pro Gly Pro Pro Pro Thr Asn Phe Ser Ser Leu Glu Leu 
385 390 395 400 



Glu Pro Arg Gly Gin Gin Pro Val Ala Lys Ala Glu Gly. Ser Pro Thr 

'410 415 



405 



Ala He Leu He Gly Cys Leu Val Ala He He Leu Leu Leu Leu Leu 
420 425 430 

He He Ala Leu Met Leu Trp Arg Leu His Trp Arg Arg Leu Leu Ser 
435 440 445 

Lys Ala Glu Arg Arg Val Leu Glu Glu Glu Leu Thr Val His Leu Ser 
450 455 460 



Val Pro Gly Asp Thr He Leu He Asn Asn Arg Pro Gly Pro Arg Glu 



465 



470 



475 



Pro Pro Pro Tyr Gin Glu Pro Arg Pro Arg Gly Asn Pro Pro His Ser 
485 490 495 

Ala Pro Cys Val Pro Asn Gly Ser Ala Leu Leu Leu Ser Asn Pro Ala 
500 505 510 

Tyr Arg Leu Leu Leu Ala Thr Tyr Ala Arg Pro Pro Arg Gly Pro Gly 
515 520 525 

Pro Pro Thr Pro Ala Trp Ala Lys Pro Thr Asn Thr Gin Ala Tyr Ser 
530 535 540 



Gly Asp Tyr Met Glu Pro Glu Lys Pro Gly Ala Pro Leu Leu Pro Pro 
545 550 555 560 
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Pro Pro Gin Asn Ser Val Pro His Tyr Ala Glu Ala Asp He Val Thr 
565 570 575 



Leu Gin Gly Val Thr Gly Gly Asn Thr Tyr Ala Val Pro Ala Leu Pro 
580 585 590 



Pro Gly Ala Val Gly Asp Gly Pro Pro Arg Val Asp Phe Pro Arg Ser 
595 600 605 

Arg Leu Arg Phe Lys Glu Lys Leu Gly Glu Gly Gin Phe Gly Glu Val 
610 615 620 



His Leu Cys Glu Val Asp Ser Pro Gin Asp Leu Val Ser Leu Asp Phe 
625 . 630 635 640 



Pro Leu Asn Val Arg Lys Gly His Pro Leu Leu Val Ala Val Lys He 
645 650 655 



Leu Arg Pro Asp Ala Thr Lys Asn Ala Ser Phe Ser Leu Phe Ser Arg 
660 665 670 



Asn Asp Phe Leu Lys Glu Val Lys He Met Ser Arg Leu Lys Asp Pro 
675 680 685 



Asn He He Arg Leu Leu Gly Val Cys Val Gin Asp Asp Pro Leu Cys 
690 695 700 



Met He Thr Asp Tyr Met Glu Asn Gly Asp Leu Asn Gin Phe Leu Ser 
705 710 715 720 

Ala His Gin Leu Glu Asp Lys Ala Ala Glu Gly Ala Pro Gly Asp Gly 
725 730 735 

Gin Ala Ala Gin Gly Pro Thr He Ser Tyr Pro Met Leu Leu His Val 
740 745 750 

Ala Ala Gin He Ala Ser Gly Met Arg Tyr Leu Ala Thr Leu Asn Phe 
755 760 765 

Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Asn Phe 
770 775 780 

Thr He Lys He Ala A3p Phe Gly Met Ser Arg Asn Leu Tyr Ala Gly 
785 790 795 800 



Asp Tyr Tyr Arg Val Gin Gly Arg Ala Val Leu Pro He Arg Trp Met 
805 810 815 



Ala Trp Glu Cys lie Leu Met Gly Lys Phe Thr Thr Ala Ser Asp Val 
820 825 830 
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Trp Ala Phe Gly Val Thr Leu Trp Glu Val Leu Met Leu Cys Arg Ala 
835 840 845 



Gin Pro Phe Gly Ser Ala His Arg Arg Ala Gly His Arg Glu Arg Gly 
850 855 860 



Gly Val Leu Pro Gly Pro Gly Pro Ala Val Tyr Leu Ser Arg Pro Pro 
865 870. 875 880 



Ala Cys Pro Gin Gly Leu Tyr Glu Leu Met Leu Arg Cys Trp Ser Arg 
885 890 895 



Glu Ser Glu Gin Arg Pro Pro Phe Ser Gin Leu His Arg Phe Leu Ala 
900 905 910 



Glu Asp Ala Leu Asn Thr Val 
915 

<210> 29 

<211> 5698 

<212> DNA 

<213> NM 001067 TOP2A 



<220> 

<221> CDS 

<222> (127) . . (4719) 

<223> 



<400> 29 

aggttcaagt ggagctctcc taaccgacgc gcgtctgtgg agaagcggct tggtcggggg 60 

tggtctcgtg gggtcctgcc tgtttagtcg ctttcagggt tcttgagccc cttcacgacc 120 

gtcacc atg gaa gtg tea cca ttg cag cct gta aat gaa aat atg caa 168 
Met Glu Val Ser Pro Leu Gin Pro Val Asn Glu Asn Met Gin 
15 10 



gtc aac aaa ata aag aaa aat gaa gat get aag aaa aga ctg tct gtt 
Val Asn Lys lie Lys Lys Asn Glu Asp Ala Lys Lys Arg Leu Ser Val 
15 ^ 20 25 30 

gaa aga ate tat caa aag aaa aca caa ttg gaa cat att ttg etc cgc 
Glu Arg lie Tyr Gin Lys Lys Thr Gin Leu Glu His lie Leu Leu Arg 
35 40 45 



216 



264 



cca gac acc tac att ggt tct gtg gaa tta gtg ace cag caa atg tgg 
Pro Asp Thr Tyr lie Gly Ser Val Glu Leu Val Thr Gin Gin Met Trp 
50 55 60 



312 



gtt tac gat gaa gat gtt ggc att aac tat agg gaa gtc act ttt gtt 



360 
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408 



456 



504 



Val Tyr Asp Glu Asp Val Gly He' Asn Tyr Arg Glu Val Thr Phe Val 
65 " 70 ■ 75 

cct ggt ttg tac aaa ate ttt gat gag att eta gtt aat get gcg gac 
Pro Gly Leu Tyr Lys He Phe Asp Glu He Leu Val Asn Ala Ala Asp 
80 85 . 90 

aac aaa caa agg gac cca aaa atg tct tgt att aga gtc aca att gat 
Asn Lys Gin Arg Asp Pro Lys Met Ser Cys He Arg Val Thr He Asp 
95 100 105 . HO 

ccg gaa aac aat tta att agt ata tgg aat aat gga aaa ggt att cct 
Pro Glu Asn Asn Leu He Ser He Trp Asn Asn Gly Lys Gly He Pro 
115 120 125 

gtt gtt gaa cac aaa gtt gaa aag atg tat gtc cca get etc ata ttt 
Val Val Glu His Lys Val Glu Lys Met Tyr Val Pro Ala Leu He Phe 
130 135 140 

gga cag etc eta act tct agt aac tat gat gat gat gaa aag aaa gtg 
Gly Gin Leu Leu Thr Ser Ser Asn Tyr Asp Asp Asp Glu Lys Lys Val 
145 150 155 

aca ggt ggt cga aat ggc tat gga gee aaa ttg tgt aac ata ttc agt 

Thr Gly Gly Arg Asn Gly Tyr Gly Ala Lys Leu Cys Asn He Phe Ser 

160 165 170 

» 

acc aaa ttt act gtg gaa aca gec agt aga gaa tac aag aaa atg ttc 

Thr Lys Phe Thr Val Glu Thr Ala Ser Arg Glu Tyr Lys Lys Met Phe 

175 180 185 190 

aaa cag aca tgg atg gat aat atg gga aga get ggt gag atg gaa etc 
Lys Gin Thr Trp Met Asp Asn Met Gly Arg Ala Gly Glu Met Glu Leu 
195 200 205 

aag ccc ttc aat gga gaa gat tat aca tgt ate acc ttt cag cct gat 
Lys Pro Phe Asn Gly Glu Asp Tyr Thr Cys He Thr Phe Gin Pro Asp 
210 215 220 

ttg tct aag ttt aaa atg caa age ctg gac aaa gat att gtt gca eta 
Leu Ser Lys Phe Lys Met Gin Ser Leu Asp Lys Asp He Val Ala Leu 
225 230 235 

atg gtc aga aga gca tat gat att get gga tec acc aaa gat gtc aaa 
Met Val Arg Arg Ala Tyr Asp He Ala Gly Ser Thr Lys Asp Val Lys 
240 245 250 

gtc ttt ctt aat gga aat aaa ctg cca gta aaa gga ttt cgt agt tat 
Val Phe Leu Asn Gly Asn Lys Leu Pro Val Lys Gly Phe Arg Ser Tyr 
255 260 265 270 

gtg gac atg tat ttg aag gac aag ttg gat gaa act ggt aac tec ttg 
Val Asp Met Tyr Leu Lys Asp Lys Leu Asp Glu Thr Gly Asn Ser Leu 
275 280 285 

aaa gta ata cat gaa caa gta aac cac agg tgg gaa gtg tgt tta act 
Lys Val He His Glu Gin Val Asn His Arg Trp Glu Val Cys Leu Thr 
290 295 300 

atg agt gaa aaa ggc ttt cag caa att age ttt gtc aac age att get 
Met Ser Glu Lys Gly Phe Gin Gin He Ser Phe Val Asn Ser He Ala 
305 " 310 315 

aca tec aag ggt ggc aga cat gtt gat tat gta get gat cag att gtg 1128 
Thr Ser Lys Gly Gly Arg His Val Asp Tyr Val Ala Asp Gin He Val 
320 325 330 



552 



600 



648 



696 



744 



792 



840 



888 



936 



984 



1032 



1080 
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act aaa ctt gtt gat gtt gtg aag aag aag aac aag ggt ggt gtt gca 1176 
Thr Lys Leu Val Asp Val Val Lys Lys Lys Asn Lys Gly Gly Val Ala 
335 340 345 350 

gta aaa gca cat cag gtg aaa aat cac atg tgg att ttt gta aat gcc 1224 
Val Lys Ala His Gin Val Lys Asn His Met Trp lie Phe Val Asn Ala 
355 360 365 

tta att gaa aac cca acc ttt gac tct cag aca aaa gaa aac atg act 1272 
Leu He Glu Asn Pro Thr Phe Asp Ser Gin Thr Lys Glu Asn Met Thr 
370 375 • 380 

tta caa ccc aag age ttt gga tea aca tgc caa ttg agt gaa aaa ttt 1320 
Leu Gin Pro Lys Ser Phe Gly Ser Thr Cys Gin Leu Ser Glu Lys Phe 
385 390 • 395 

ate aaa get gcc att ggc tgt ggt att gta gaa age ata eta aac tgg 1368 
He Lys Ala Ala He Gly Cys Gly He Val Glu Ser He Leu Asn Trp 
400 405 410 

gtg aag ttt aag gcc caa gtc cag tta aac aag aag tgt tea get gta 1416 
Val Lys Phe Lys Ala Gin Val. Gin Leu Asn Lys Lys Cys Ser Ala Val 
415 . " 420 425 430 



aaa cat aat aga ate aag gga att ccc aaa etc gat gat gcc aat gat 1464 
Lys His Asn Arg He Lys Gly He Pro Lys Leu Asp Asp Ala Asn Asp 
435 440 445 

gca ggg ggc cga aac tec act gag tgt acg ctt ate ctg act gag gga 1512 
Ala Gly Gly Arg Asn Ser Thr Glu Cys Thr Leu He Leu Thr Glu Gly 
450 455 460 

gat tea gcc aaa act ttg get gtt tea ggc ctt ggt gtg gtt ggg aga 1560 
Asp Ser Ala Lys Thr Leu Ala Val Ser Gly Leu Gly Val Val Gly Arg 
465 470 475 

gac aaa tat ggg gtt ttc cct ctt aga gga aaa ata etc aat gtt cga 1608 
Asp Lys Tyr Gly Val Phe Pro Leu Arg Gly Lys He Leu Asn Val Arg 
480 485 490 

gaa get tct cat aag cag ate atg gaa aat get gag att aac aat ate 1656 
Glu Ala Ser His Lys Gin He Met Glu . Asn Ala Glu He Asn Asn He 
495 500 505 510 

ate aag att gtg ggt ctt cag tac aag aaa aac tat gaa gat gaa gat 1704 
He Lys He Val Gly Leu Gin Tyr Lys Lys Asn Tyr Glu Asp Glu Asp 
515 520 525 

tea ttg aag acg ctt cgt tat ggg aag ata atg att atg aca gat cag 1752 
Ser Leu Lys Thr Leu Arg Tyr. Gly Lys He Met He Met Thr Asp Gin 
530 535 540 

gac caa gat ggt tec cac ate aaa ggc ttg ctg att aat ttt ate cat 1800 
Asp Gin Asp Gly Ser His He Lys Gly Leu Leu He Asn Phe He His 
545 550 555 



cac aac tgg ccc tct ctt ctg cga cat cgt ttt ctg gag gaa ttt ate 1848 

His Asn Trp Pro Ser Leu Leu Arg His Arg Phe Leu Glu Glu Phe He 

560 565 570 

act ccc att gta aag gta tct aaa aac aag caa gaa atg gca ttt tac 1896 

Thr Pro He Val Lys Val Ser Lys Asn Lys Gin Glu Met Ala Phe Tyr 

575 580 585 590 

age ctt cct gaa ttt gaa gag tgg aag agt tct- act cca aat cat aaa 1944 
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Ser Leu Pro Glu Phe Glu Glu Trp Lys Ser Ser Thr Pro Asn His Lys 
595 600 605 

aaa tgg aaa gtc aaa tat tac aaa ggt ttg ggc acc age aca tea aag 1992 

Lys Trp Lys Val Lys Tyr Tyr Lys Gly Leu Gly Thr Ser Thr Ser Lys 

610 615 620 

gaa get aaa gaa tac ttt gca gat atg aaa aga cat cgt ate cag ttc 2040 

Glu Ala Lys Glu Tyr Phe Ala Asp Met Lys Arg His Arg He Gin Phe 
625 630 635 

aaa tat tct ggt cct gaa gat gat get get ate age ctg gee ttt age 2088 

Lys Tyr Ser Gly Pro Glu Asp Asp Ala Ala He Ser Leu Ala Phe Ser 
640 645 650 

aaa aaa cag ata gat gat cga aag gaa tgg tta act aat ttc atg gag 2136 

Lys Lys Gin He Asp Asp Arg Lys Glu. Trp Leu Thr Asn Phe Met Glu 
655 660 665 670 

gat aga aga caa cga aag tta ctt ggg ctt cct gag gat tac ttg tat 2184 
Asp Arg Arg Gin Arg Lys Leu Leu Gly Leu Pro Glu Asp Tyr Leu Tyr 
675 680 685 



gga caa act acc aca tat ctg aca tat aat gac ttc ate aac aag gaa 2232 

Gly Gin Thr Thr Thr Tyr Leu Thr Tyr Asn Asp Phe He Asn Lys Glu 
690 695 700 

ctt ate ttg ttc tea aat tct gat aac gag aga tct ate cct tct atg 2280 

Leu He Leu Phe Ser Asn Ser Asp Asn Glu Arg Ser He Pro Ser Met 
705 710 ' 715 

gtg gat ggt ttg aaa cca ggt cag aga aag gtt ttg ttt act tgc ttc 2328 

Yal Asp Gly Leu Lys Pro Gly Gin Arg Lys Val Leu Phe Thr Cys Phe 
720 725 730 

aaa egg aat gac aag cga gaa gta aag gtt gee caa tta get gga tea 2376 

Lys Arg Asn Asp Lys Arg Glu Val Lys Val Ala Gin Leu Ala Gly Ser 

735 740 745 750 

gtg get gaa atg tct tct tat cat cat ggt gag atg tea eta atg atg 2424 

Val Ala Glu Met Ser Ser Tyr His His Gly Glu Met Ser Leu Met Met 
755 760 765 

acc att ate aat ttg get cag aat ttt gtg ggt age aat aat eta aac 2472 

Thr He He Asn Leu Ala Gin Asn Phe Val Gly Ser Asn Asn Leu Asn 
770 775 780 

etc ttg cag ccc att ggt cag ttt ggt acc agg eta cat ggt ggc aag 2520 

Leu Leu Gin Pro He Gly Gin Phe Gly Thr Arg Leu His Gly Gly Lys 
785 790 795 

gat tct get agt cca cga tac ate ttt aca atg etc age tct ttg get 2568 

Asp Ser Ala Ser Pro Arg Tyr He Phe Thr Met Leu Ser Ser Leu Ala 
800 805 810 



cga ttg tta ttt cca cca aaa gat gat 
Arg Leu Leu Phe Pro Pro Lys Asp Asp 
815 820 

gat gac aac cag cgt gtt gag cct gaa 

Asp Asp Asn Gin Arg Val Glu Pro Glu 
835 

atg gtg ctg ata aat ggt get gaa gga 

Met Val Leu He Asn Gly Ala Glu Gly 

850 855 



cac acg ttg aag ttt tta tat 2616 
His Thr Leu Lys Phe Leu Tyr 
825 830 

tgg tac att cct att att ccc 2664 
Trp Tyr lie Pro He He Pro 
840 845 

ate ggt act ggg tgg tec tgc 2712 
He Gly Thr .Gly Trp Ser Cys 
860 
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aaa ate ccc aac ttt gat gtg cgt gaa att gta aat aac ate agg cgt 
Lys lie Pro Asn Phe Asp Val Arg Glu He Val Asn Asn He Arg Arg 
865 870 875 

ttg atg gat gga gaa gaa 'cct ttg cca atg ctt cca agt tac aag aac 
Leu Met Asp Gly Glu Glu Pro Leu Pro Met Leu Pro Ser Tyr Lys Asn 
880 885 890 

ttc aag ggt act att gaa gaa ctg get cca aat caa tat gtg att agt 
Phe Lys Gly Thr He Glu Glu Leu Ala Pro Asn Gin Tyr Val He Ser 
895 900 905 910 

ggt gaa gta get att ctt aat tct aca acc att gaa ate tea gag ctt 
Gly Glu Val Ala lie Leu Asn Ser Thr Thr He Glu He Ser Glu Leu 
915 920. 925 

ccc gtc aga aca tgg acc cag aca tac aaa gaa caa gtt eta gaa ccc 
Pro Val Arg Thr Trp Thr Gin Thr Tyr Lys Glu Gin .Val Leu Glu Pro 
930 935 940 



atg ttg aat ggc acc gag aag aca cct cct etc ata aca gac tat agg 
Met Leu Asn Gly Thr Glu Lys Thr Pro Pro Leu He Thr Asp Tyr Arg 
945 950 955 

gaa tac cat aca gat acc act gtg aaa ttt gtt gtg aag atg act gaa 
Glu Tyr His Thr Asp Thr Thr Val Lys Phe Val Val Lys Met Thr Glu 
960 965 970 

gaa aaa ctg gca gag gca gag aga gtt gga eta cac aaa gtc ttc aaa 
Glu Lys Leu Ala Glu Ala Glu Arg Val Gly Leu His Lys Val Phe Lys 
975 980 985 990 

etc caa act agt etc aca tgc aac tct atg gtg ctt ttt gac cac gta 
Leu Gin Thr Ser Leu Thr Cys Asn Ser Met Val Leu Phe Asp His Val 
995 1000 1005 

ggc tgt tta aag aaa tat gac acg gtg ttg gat att eta aga gac 
Gly Cys Leu Lys Lys Tyr Asp Thr Val Leu Asp He Leu Arg Asp 
1010 1015 1020 



gtt cca gat gaa gaa gaa aat gaa gag agt gac aac gaa aag gaa 
Val Pro Asp Glu Glu Glu Asn Glu Glu Ser Asp Asn Glu Lys Glu 
1100 H05 1110 

act gaa aag agt gac tec gta aca gat tct gga cca acc ttc aac 



2760 



2808 



2856 



2904 



2952 



3000 



3048 



3096 



3144 



3189 



ttt ttt gaa etc aga ctt aaa tat tat gga tta aga aaa gaa tgg 3234 

Phe Phe Glu Leu Arg Leu Lys Tyr Tyr Gly Leu Arg Lys Glu Trp 
1025 1030 1035 

etc eta gga atg ctt ggt get gaa tct get aaa ctg aat aat cag 3279 

Leu Leu Gly Met Leu Gly Ala Glu Ser Ala Lys Leu Asn Asn Gin 
1040 1045 1050 

get cgc ttt ate tta gag aaa ata gat ggc aaa ata ate att gaa 

Ala Arg Phe He Leu Glu Lys He Asp Gly Lys lie He He Glu 
1055 1060 1065 



3324 



aat aag cct aag aaa gaa tta att aaa gtt ctg att cag agg gga 3369 

Asn Lys Pro Lys Lys Glu Leu He Lys Val Leu He Gin Arg Gly 
1070 1075 1080 
t 

tat gat teg gat cct gtg aag gee tgg aaa gaa gee cag caa aag 3414 

Tyr Asp Ser Asp Pro Val Lys' Ala Trp Lys Glu Ala Gin Gin Lys 
1085 1090 1095 



3459 



3504 
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Thr Glu Lys Ser Asp Ser Val Thr Asp Ser Gly Pro Thr Phe Asn 
1115 1120 1125 



tat ctt ctt gat atg ccc ctt tgg tat tta acc aag gaa aag aaa 
Tyr Leu Leu Asp Met Pro Leu Trp Tyr Leu Thr Lys Glu Lys Lys 
1130 1135 1140 



3549 



gat gaa etc tgc 
Asp Glu Leu Cys 
1145 



agg eta aga aat gaa aaa gaa caa gag ctg gac 
Arg Leu Arg Asn Glu Lys Glu Gin Glu Leu Asp 
1150 1155 



3594 



aca tta aaa aga aag agt cca tea gat 
Thr Leu Lys Arg Lys Ser Pro Ser Asp 
1160 "* 1165 



ttg tgg aaa gaa gac ttg 
Leu Trp Lys Glu Asp Leu 
1170 



3639 



get aca ttt att 
Ala Thr Phe He 
1175 



gaa gaa ttg gag get gtt gaa gee aag gaa aaa 
Glu Glu Leu Glu Ala Val Glu Ala Lys Glu Lys 
1180 , 1185 



3684 



caa gat gaa caa gtc gga ctt cct ggg 
Gin Asp Glu Gin Val Gly Leu Pro Gly 
1190 1195 



aaa ggg ggg aag gee aag 
Lys Gly Gly Lys Ala Lys 
1200 



3729 



ggg aaa aaa aca caa atg get gaa gtt ttg cct tct ccg cgt ggt 
Gly Lys Lys Thr Gin Met Ala Glu Val Leu Pro Ser Pro Arg Gly 
1205 1210 1215 



3774 



caa aga gtc att cca cga ata acc ata 
Gin Arg Val He Pro Arg He Thr He 
1220 1225 



gaa atg aaa gca gag gca 
Glu Met Lys Ala Glu Ala 
1230 



3819 



gaa aag aaa aat aaa aag aaa att aag aat gaa aat act gaa gga 
Glu Lys Lys Asn Lys Lys Lys He Lys Asn Glu Asn Thr Glu Gly 
1235 1240 1245 



3864 



age cct caa gaa gat ggt gtg gaa eta gaa ggc eta aaa caa aga 
Ser Pro Gin Glu Asp Gly Val Glu Leu Glu Gly Leu Lys Gin Arg 
1250 ~ 1255 1260 



3909 



tta gaa aag aaa cag aaa aga gaa cca" ggt aca aag aca aag aaa 
Leu Glu Lys Lys Gin Lys Arg Glu Pro Gly Thr Lys Thr Lys Lys 
1265 1270 1275 



3954 



caa act aca ttg gca ttt aag cca ate aaa aaa gga aag aag aga 
Gin Thr Thr Leu Ala Phe Lys Pro He Lys Lys Gly Lys Lys Arg 
1280 1285 1290 



3999 



aat ccc tgg tct gat tea gaa tea gat agg age agt gac gaa agt 
Asn Pro Trp Ser Asp Ser Glu Ser Asp Arg Ser Ser Asp Glu Ser 
1295 1300 1305 



4044 



aat ttt gat gtc cct cca cga gaa aca gag cca egg aga gca gca 4089 

Asn Phe Asp Val Pro Pro Arg Glu Thr Glu Pro Arg Arg Ala Ala 
1310 1315 1320 

aca aaa aca aaa ttc aca atg gat ttg gat tea gat gaa gat ttc 4134 

Thr Lys Thr Lys Phe Thr Met Asp Leu Asp Ser Asp Glu Asp Phe 
1325 1330 1335 

tea gat ttt gat gaa aaa act gat gat gaa gat ttt gtc cca tea 4179 

Ser Asp Phe Asp Glu Lys Thr Asp Asp Glu Asp Phe Val Pro Ser 
1340 1345 1350 

gat get agt cca cct aag acc aaa act tec cca aaa ctt agt aac 4224 

Asp Ala Ser Pro Pro Lys Thr Lys Thr Ser. Pro Lys Leu Ser Asn 
1355 1360 1365 
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aaa gaa ctg aaa cca cag aaa agt gtc gtg tea gac ctt gaa get 
Lys Glu Leu Lys Pro Gin LysSer Val Val Ser Asp Leu Glu Ala 
1370 1375 . 1380 

gat gat gtt aag ggc agt gta cca ctg tct tea age cct cct get 
Asp Asp Val Lys Gly Ser Val Pro Leu Ser Ser Ser Pro Pro Ala 
1385 1390 1395 

aca cat ttc cca gat gaa act gaa att aca aac cca gtt cct aaa 
Thr His Phe Pro Asp Glu Thr Glu lie Thr Asn. Pro Val Pro Lys 
1400 1405 1410 

aag aat gtg aca gtg aag aag aca gca gca aaa agt cag tct tec 
Lys Asn Val Thr Val Lys Lys Thr Ala Ala Lys Ser Gin Ser Ser 
1415 1420 1425 

acc tec act acc ggt gee aaa aaa agg get gee cca aaa gga act 
Thr Ser Thr Thr Gly Ala Lys Lys Arg ' Ala Ala Pro Lys Gly Thr 
1430 1435 1440 

aaa agg gat cca get ttg aat tct ggt gtc tct caa aag cct gat 
Lys Arg Asp Pro Ala Leu Asn Ser Gly Val Ser Gin Lys Pro Asp 
1445 1450 . 1455 

cct gee aaa acc aag aat cgc cgc aaa agg aag cca tec act tct 
Pro Ala Lys Thr Lys Asn Arg Arg Lys Arg Lys Pro Ser Thr Ser 
1460 1465 1470 

gat gat tct gac tct aat ttt gag aaa att gtt teg aaa gca gtc 
Asp Asp Ser Asp Ser Asn Phe Glu Lys He Val Ser Lys Ala Val 
1475 1480 1485 

aca age aag aaa tec aag ggg gag agt gat gac ttc cat atg gac 
Thr Ser Lys Lys Ser Lys Gly Glu Ser Asp Asp Phe His Met Asp 
1490 1495 1500 

ttt gac tea get gtg get cct egg gca aaa tct gta egg gca aag 
Phe Asp Ser Ala Val Ala Pro Arg Ala Lys Ser Val Arg Ala Lys 
1505 1510 1515 

aaa cct ata aag tac ctg gaa gag tea gat gaa gat gat ctg ttt 
Lys Pro He Lys Tyr Leu Glu Glu Ser Asp Glu Asp Asp Leu Phe 
1520 1525 1530 



4269 



4314 



4359 



4404 



4449 



4494 



4539 



4584 



4629 



4674 



4719 



4779 
4839 
4899 



taaaatgtga ggegattatt ttaagtaatt atcttaccaa gcccaagact ggttttaaag 
ttacctgaag ctcttaactt cctcccctct gaatttagtt tggggaaggt gtttttagta 
caagacatca aagtgaagta aagcccaagt gttctttagc tttttataat actgtctaaa 
tagtgaccat ctcatgggca ttgttttctt etctgetttg tctgtgtttt gagtctgett 4959 
tcttttgtct ttaaaacctg atttttaagt tcttctgaac tgtagaaata gctatctgat 5019 
cacttcagcg taaagcagtg tgtttattaa ccatccacta agctaaaact agagcagttt 5079 
gatttaaaag tgtcactctt cctccttttc tactttcagt agatatgaga tagagcataa 5139 
ttatctgttt tatcttagtt ttatacataa tttaccatca gatagaactt tatggttcta 5199 
gtacagatac tctactacac tcagcctctt atgtgccaag tttttcttta agcaatgaga 5259 
aattgetcat gttcttcatc ttctcaaatc ateagaggee aaagaaaaac actttggctg 5319 
tgtctataac ttgacacagt caa'tagaatg aagaaaatta gagtagttat gtgattattt 



5379 
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cagctcttga cctgtcccct ctggctgcct ctgagtctga atctcccaaa gagagaaacc 5439 

aatttctaag aggactggat tgcagaagac tcggggacaa catttgatcc aagatcttaa 5499 

atgttatatt gataaccatg ctcagcaatg agctattaga ttcattttgg gaaatctcca 5559 

taatttcaat ttgtaaactt tgttaagacc tgtctacatt gttatatgtg tgtgacttga 5619 

gtaatgttat caacgttttt gtaaatattt actatgtttt tctattagct aaattccaac 5679 

aattttgtac tttaataaa 5698 

. <210> 30 

<211> 1531 

<212> PRT 

<213> NM 001067 T0P2A 



<400>- 30 

Met Glu Val Ser Pro Leu Gin Pro Val Asn Glu Asn Met Gin Val Asn 
1 5 10 15 



Lys lie Lys Lys Asn Glu Asp Ala Lys Lys Arg Leu Ser Val Glu Arg 
20 25 ' 30 



lie Tyr Gin Lys Lys Thr Gin Leu Glu His lie Leu Leu Arg Pro Asp 
35 " 40 45 



Thr Tyr lie Gly Ser Val Glu Leu Val Thr Gin Gin Met Trp Val Tyr 
50 55 60 



Asp Glu Asp Val Gly lie Asn Tyr Arg Glu Val Thr Phe Val Pro Gly 
65 70 75 80 



Leu Tyr Lys lie Phe Asp Glu lie Leu Val Asn Ala Ala Asp Asn Lys 
85 90 ' 95 



Gin Arg Asp Pro Lys Met Ser Cys lie Arg Val Thr lie Asp Pro Glu 
100 105 110 



Asn Asn Leu lie Ser lie Trp Asn Asn Gly Lys Gly lie Pro Val Val 
115 120 ~* 125 



Glu His Lys Val Glu . Lys Met Tyr Val Pro Ala Leu lie Phe Gly Gin 
130 135 140 



Leu Leu Thr Ser Ser Asn Tyr Asp Asp Asp Glu Lys Lys Val Thr Gly 
145 150 155 160 



Gly Arg Asn Gly Tyr Gly Ala Lys Leu Cys Asn He Phe Ser Thr Lys 
165 170 175 
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Phe Thr Val Glu Thr Ala Ser Arg Glu Tyr Lys Lys Met Phe Lys Gin 
180 185 190 



Thr Trp Met Asp Asn Met Gly Arg Ala Gly Glu Met Glu Leu Lys Pro 
195 200 205 



Phe Asn Gly Glu Asp Tyr Thr Cys He Thr Phe Gin Pro Asp Leu Ser 
210 215 220 



Lys Phe Lys Met Gin Ser Leu Asp Lys Asp He Val Ala Leu Met Val 
225 230 235 240 



Arg Arg Ala Tyr Asp He Ala Gly Ser Thr Lys Asp Val Lys Val Phe 
245 250 ' 255 



Leu Asn Gly Asn Lys Leu Pro Val Lys Gly Phe Arg Ser Tyr Val Asp 
260 265 270 



Met Tyr Leu Lys Asp Lys Leu Asp Glu Thr Gly Asn Ser Leu Lys Val 
275 280 285 



He His Glu Gin Val Asn His Arg Trp Glu Val Cys Leu Thr Met Ser 
290 295 300 



Glu Lys Gly Phe Gin Gin He Ser Phe Val Asn Ser He Ala Thr Ser 
305 310 315 320 



Lys Gly Gly Arg His Val Asp Tyr Val Ala Asp Gin He Val Thr Lys 
325 330 335 



Leu Val Asp Val Val Lys Lys Lys Asn Lys Gly Gly Val Ala Val Lys 
340 345 350 



Ala His Gin Val Lys Asn His Met Trp He Phe Val Asn Ala Leu He 
355 360 365 



Glu Asn Pro Thr Phe Asp Ser Gin Thr Lys Glu Asn Met Thr Leu Gin 
370 375 380 



Pro Lys Ser Phe Gly Ser Thr Cys Gin Leu Ser Glu Lys Phe He Lys 
385 390 395 400 



Ala Ala He Gly Cys Gly He Val Glu Ser He Leu Asn Trp Val Lys 
405 410 415 



Phe Lys Ala Gin Val Gin Leu Asn Lys Lys Cys Ser Ala Val Lys His 
420 425 430 



Asn Arg He Lys Gly He Pro Lys Leu Asp Asp Ala Asn Asp Ala Gly 
435 440 ' 445 
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Gly Arg Asn Ser Thr Glu Cys Thr Leu lie Leu Thr Glu Gly Asp Ser 
450 455 '460 



Ala Lys Thr Leu Ala Val Ser Gly Leu Gly Val Val Gly Arg Asp Lys 
465 470 475 480 



Tyr Gly Val Phe Pro Leu Arg Gly Lys lie Leu Asn Val Arg Glu Ala 
485 490 495 



Ser His Lys Gin lie Met Glu Asn Ala Glu lie Ash Asn lie lie Lys 
500 505 510 



lie Val Gly Leu Gin Tyr Lys Lys Asn Tyr Glu Asp Glu Asp Ser Leu 
515 520 525 



Lys Thr Leu Arg Tyr Gly Lys lie Met lie Met Thr Asp Gin Asp Gin 
530 535 540 



Asp Gly Ser His lie Lys Gly Leu Leu lie Asn Phe lie His His Asn 
545 550 555 560 



Trp Pro Ser Leu Leu Arg His Arg Phe Leu Glu Glu Phe lie Thr Pro 
565 570 575 



He Val Lys Val Ser Lys Asn Lys Gin Glu Met Ala Phe Tyr Ser Leu 
580 585 590 



Pro Glu Phe Glu Glu Trp Lys Ser Ser Thr Pro Asn His Lys Lys Trp 
595 600 605 



Lys Val Lys Tyr Tyr Lys Gly Leu Gly Thr Ser Thr Ser Lys Glu Ala 
610 615 620 



Lys Glu Tyr Phe Ala Asp Met Lys Arg His Arg He Gin Phe Lys Tyr 
625 630 635 640 



Ser Gly Pro Glu Asp Asp Ala Ala He Ser Leu Ala . Phe Ser Lys Lys 
645 650 655 



Gin He Asp Asp Arg Lys Glu Tfp Leu Thr Asn Phe Met Glu Asp Arg 
660 665 670 



Arg Gin Arg Lys Leu Leu Gly Leu Pro Glu Asp Tyr Leu Tyr Gly Gin 
675 680 685 



Thr Thr Thr Tyr Leu Thr Tyr Asn Asp Phe He Asn Lys Glu Leu He 
690 695 700 



Leu Phe Ser Asn Ser Asp Asn Glu Arg Ser He Pro Ser Met Val Asp 
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705 



710 



715 



720 



Gly Leu Lys Pro Gly Gin Arg Lys Val Leu Phe Thr Cys Phe Lys Arg 
725 730 " 735 



Asn Asp Lys Arg Glu Val Lys Val Ala Gin Leu Ala Gly Ser Val Ala 
740 745 750 



Glu Met Ser Ser Tyr His His Gly Glu Met Ser Leu Met Met Thr lie 
755 760 765 

He Asn Leu Ala Gin Asn Phe Val Gly Ser Asn Asn Leu Asn Leu Leu 
770 775 780 



Gin Pro He Gly Gin Phe Gly Thr Arg Leu His Gly Gly Lys Asp Ser 
785 790 795 " 800 

Ala Ser Pro Arg Tyr He Phe Thr Met Leu Ser Ser Leu Ala Arg Leu 
805 810 815 



Leu Phe Pro Pro Lys Asp Asp His Thr Leu Lys Phe Leu Tyr Asp Asp 
820 825 830 



Asn Gin Arg Val Glu Pro Glu Trp Tyr He Pro He He Pro Met Val 
835 840 845 



Leu He Asn Gly Ala Glu Gly He Gly Thr Gly Trp Ser Cys Lys He 
850 855 860 



Pro Asn Phe Asp Val Arg Glu He Val Asn Asn He Arg Arg Leu Met 
865 870 875 880 



Asp Gly Glu Glu Pro Leu Pro Met Leu Pro Ser Tyr Lys Asn Phe Lys 
885 890 * 895 

Gly Thr He Glu Glu Leu Ala Pro Asn Gin Tyr Val He Ser Gly Glu 
900 905 910 



Val Ala He Leu Asn Ser Thr Thr He Glu lie Ser Glu Leu Pro Val 
915 920 925 



Arg Thr Trp Thr Gin Thr Tyr Lys Glu Gin Val Leu Glu Pro Met Leu 
930 935 940 



Asn Gly Thr Glu Lys Thr Pro Pro Leu He Thr Asp Tyr Arg Glu Tyr 
945 950 955 " 960 



His Thr Asp Thr Thr Val Lys Phe Val Val Lys Met Thr Glu Glu Lys 
965 970 975 



Leu Ala Glu Ala Glu Arg Val Gly Leu His Lys Val Phe Lys Leu Gin 
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980 985 990 



Thr Ser Leu Thr Cys Asn Ser Met Val Leu Phe Asp His Val Gly Cys 
995 1000 1005 



Leu Lys Lys Tyr Asp Thr Val Leu Asp He Leu Arg Asp Phe Phe 
1010 1015 1020 



Glu Leu Arg Leu Lys Tyr Tyr Gly Leu Arg Lys Glu Trp Leu Leu 
1025 1030 1035 



Gly Met Leu Gly Ala Glu Ser Ala Lys Leu Asn Asn Gin Ala Arg 
1040 1045 1050 



Phe He Leu Glu Lys He Asp Gly Lys He He He Glu Asn Lys 
1055 1060 1065 



Pro Lys Lys Glu Leu He Lys Val Leu He Gin Arg Gly Tyr Asp 
1070 1075 1080 



Ser Asp Pro Val Lys Ala Trp Lys Glu Ala Gin Gin Lys Val Pro 
1085 1090 1095 



Asp Glu Glu Glu Asn Glu Glu Ser Asp Ash Glu Lys Glu Thr Glu 
1100 H05 mo 



Lys Ser Asp Ser Val Thr Asp Ser Gly Pro Thr Phe Asn Tyr Leu 
1115 1120 1125 



Leu Asp Met Pro Leu Trp Tyr Leu Thr Lys Glu Lys Lys Asp Glu 
1130 H35 1140 



Leu Cys Arg Leu Arg Asn Glu Lys Glu Gin Glu Leu Asp Thr Leu 
1145 H50 1155 



Lys Arg Lys Ser Pro Ser Asp Leu Trp Lys Glu Asp Leu Ala Thr 
1160 H65 1170 



Phe He Glu Glu Leu Glu Ala Val Glu Ala Lys Glu Lys Gin Asp 
1175 H80 * 1185 



Glu Gin Val Gly' Leu Pro Gly Lys Gly Gly Lys Ala Lys Gly Lys 
1190 H95 1200 



Lys Thr Gin Met Ala Glu Val Leu Pro Ser Pro Arg Gly Gin Arg 
1205 1210 1215 



Val He Pro Arg He Thr He Glu Met Lys Ala Glu Ala Glu Lys 
1220 1225 1230 



Lys Asn Lys Lys Lys He Lys Asn Glu Asn Thr Glu Gly Ser Pro 



WO 2004/022778 PCT/AU2003/001166 

80 

1235 1240 1245 

Gin Glu Asp Gly Val Glu Leu Glu Gly Leu Lys Gin Arg Leu Glu 
1250 1255 1260 

Lys Lys Gin Lys Arg Glu Pro Gly Thr. Lys Thr Lys Lys Gin Thr 
1265 "* " 1270 • 1275 

Thr Leu Ala Phe Lys Pro lie Lys Lys Gly Lys Lys Arg Asn Pro 
1280 1285 1290 

Trp Ser Asp Ser Glu Ser Asp Arg Ser Ser Asp Glu Ser Asn Phe 
1295 1300 1305 

Asp Val Pro Pro Arg Glu Thr Glu Pro Arg Arg Ala Ala Thr Lys 
1310 1315 1320 

Thr Lys Phe Thr Met Asp Leu Asp Ser Asp Glu Asp Phe Ser Asp 
1325 1330 1335 

Phe Asp Glu Lys Thr Asp Asp Glu Asp Phe Val Pro Ser Asp Ala 
1340 1345 1350 



Ser Pro Pro Lys Thr Lys Thr Ser Pro Lys Leu Ser Asn Lys Glu 
1355 1360 1365 



Leu Lys Pro Gin Lys Ser Val Val Ser Asp Leu Glu Ala Asp Asp 
1370 1375 1380 



Val Lys Gly Ser Val Pro Leu Ser Ser Ser Pro Pro Ala Thr His 
1385 1390 1395 



Phe Pro Asp Glu Thr Glu lie Thr Asn Pro Val Pro Lys Lys Asn 
1400 1405 1410 



Val Thr Val Lys Lys Thr Ala Ala Lys Ser Gin Ser Ser Thr Ser 
1415 1420 1425 



Thr Thr Gly Ala Lys Lys Arg Ala Ala Pro Lys Gly Thr Lys Arg 
1430 - 1435 1440 



Asp Pro Ala Leu Asn Ser Gly Val Ser Gin Lys Pro Asp Pro Ala 
1445 1450 1455 



Lys Thr Lys Asn Arg Arg Lys Arg Lys Pro Ser Thr Ser Asp Asp 
1460 1465 1470 



Ser Asp Ser Asn Phe Glu Lys lie Val Ser Lys Ala Val Thr Ser 
1475 1480 1485 
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Lys Lys Ser Lys Gly Glu Ser Asp Asp Phe His Met Asp Phe Asp 
1490 1495 1500 



Ser Ala Val Ala Pro Arg Ala Lys Ser Val Arg Ala Lys Lys Pro 
1505 1510 1515 



lie Lys Tyr Leu. Glu Glu Ser Asp Glu Asp Asp Leu Phe 
1520 * 1525 1530 



<210> 31 

<211> 1369 

<212> DNA 

<213> NMJL38410 chemokine-like factor super family 7, CKLFSF7 
<220> 

<221> CDS 

<222> (237) . . (761) 

<223> 



<400> 31 

agtctaactt ccgcgcatct acgagggccg ggactgccgc tacctttctg gaaggcgccg 60 

gccggccagt caccggaggg atcccgccag aggcggctcc gcgtctccag cagcggggca 120 

gggacccggg cgccccgccc tcgccagcgc ccgcccccgc ccgcccccgg cccgccctct 180 

gtatctggcc cctgggcagc tgcccgggga ggcggccagc gagctggggc cgcgca atg 239 

Met 
1 

teg cac gga gec ggg etc gtc cgc acc acg tgc age age ggc age gcg 287 
Ser His Gly Ala Gly Leu Val Arg Thr Thr Cys Ser Ser Gly Ser Ala 
5 10 15 

etc gga ccc ggg gee ggc gcg gec cag ccc age gcg age ccc ttg gag 335 
Leu Gly Pro Gly Ala Gly Ala Ala Gin Pro Ser Ala Ser Pro Leu Glu 
20 25 30 

ggg ctg ctg gac etc age tac ccc cgc acc cac gcg gec ctg ctg aaa 383 
Gly Leu Leu Asp Leu Ser Tyr Pro Arg Thr His Ala Ala Leu Leu Lys 
35 40 45 

gtg gcg caa atg gtc acc ctg ctg att gee ttc ate tgt gtg egg age 431 
Val Ala Gin Met Val Thr Leu Leu He Ala Phe He Cys Val Arg Ser 
50 55 60 65 

tec ctg tgg acc aac tac age gec tac age tac ttt gaa gtg gtc acc 479 
Ser Leu Trp Thr Asn Tyr Ser Ala Tyr Ser Tyr Phe Glu Val Val Thr 
70 75 80 

att tgc gac ttg ata atg ate etc gee ttt tac ctg gtc cac etc ttc 527 
He Cys Asp Leu He Met He Leu Ala Phe Tyr Leu Val His Leu Phe 
85 90 95 

cgc ttc tac cgc gtg etc acc tgt ate age tgg ccc ctg teg gaa ctt 575 
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Arg Phe Tyr Arg Val Leu Thr Cys He Ser Trp Pro Leu Ser Glu Leu 

100 105 110 

ctg cac tat tta ate ggt acc ctg etc etc etc ate gee tec att gtg 623 

Leu His Tyr Leu He Gly Thr Leu Leu Leu Leu He Ala Ser He Val 

115 120 125 

gca get tec aag agt tac aae cag age gga ctg gta gee gga gcg ate 671 

Ala Ala Ser Lys Ser Tyr Asn Gin Ser Gly Leu Val Ala Gly Ala He 

130 135 140 145 

ttt ggt ttc atg gee acc ttc etc tgc atg gca age ata tgg ctg tec 719 

Phe Gly Phe Met Ala Thr Phe Leu Cys Met Ala Ser He Trp Leu Ser 

150 155 160 

tat aag ate teg tgt gta acc cag tec aca gat gca gee gtc 761 

Tyr Lys He Ser Cys Val Thr Gin Ser Thr Asp Ala Ala Val 





165 


170 


175 




tgatgaggee 


acaaccccta 


ggcccctcag 


gagctttgea 


gagaggagga cgtgtactcc 


821 


aggegaggee 


tctggacctg 


tgttcctgtg 


ccaaagtcct 


gtcaggctgg tgggcaccag 


881 


gaaaggcctg 


caccctcttc 


ctgctctccc 


aggaagecag 


ctccctgagc tcctgagcca 


941 


geeggaaact 


cttcctccag 


ccttccgggg 


agaacatccc 


tcccattctg ggaaaggaaa 


1001 


gcagcctcca 


gggaaatgtt 


ttctgccttc 


ctgettctag 


aaccacctca ggtactgatg 


1061 


aaccccactt 


agcacagctg 


aaggggtttg 


tgaatactcc 


cgcctaaatc ccttctactt 


1121 


cactcctcag 


gggagtgaag 


tgecttaaga 


aacaaagccc 


tgtcctaatt tatctagctt 


1181 


gtcagtcegg 


tcttagagat 


accctctttc 


ctgaagtgag 


gcgtgcctgt agaaacacta 


1241 


tgtggtcagc 


ctgtccccaa 


ggagatcttg 


tgtctcctct 


ccatctctgc ctttgttacc 


1301 


agtgtgcatg 


tgtttgtgtg 


ttttttaata 


aaatattgac 


teggecagtt aaaaaaaaaa 


1361 


aaaaaaaa 










1369 



<210> 32 

<211> 175 

<212> PRT 

<213> NM_138410 chemokine-liXe factor super family 7, CKLFSF7 



<400> 32 

Met Ser His Gly Ala Gly Leu Val Arg Thr Thr Cys Ser Ser Gly Ser 
15 10 15 



Ala Leu Gly Pro Gly Ala Gly Ala Ala Gin Pro Ser Ala Ser Pro Leu 
20 25 30 



Glu Gly Leu Leu Asp Leu Ser Tyr Pro Arg Thr His Ala Ala Leu Leu 
35 40 45 



Lys Val Ala Gin Met Val Thr Leu Leu He Ala Phe He Cys Val Arg 
50 55 60 
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Ser Ser Leu Trp Thr Asn Tyr Ser Ala Tyr Ser Tyr Phe Glu Val Val 
65 70 75 . 80 



Thr lie Cys Asp Leu He Met He Leu Ala Phe Tyr Leu Val His Leu 
85 90 95 



Phe Arg Phe Tyr Arg Val Leu Thr Cys lie Ser Trp Pro Leu Ser Glu 
100 " 105 110 



Leu Leu His Tyr Leu He Gly Thr Leu Leu Leu Leu He Ala Ser He 
115 120 125 



Val Ala Ala Ser Lys Ser Tyr Asn Gin Ser Gly Leu Val Ala Gly Ala 
• 130 135 140. 



He Phe Gly Phe Met Ala Thr Phe Leu Cys Met Ala Ser He Trp Leu 
145 150 155 160 



Ser Tyr Lys He Ser Cys Val Thr Gin Ser Thr Asp Ala Ala Val 
165 170 175 

<210> 33 
<211> 3470 
<212> DNA 

<213> NM_012112 C20orfl, chromosome' 20 open reading frame 1 



<220> 

<221> CDS 

<222> (458) . . (2698) 

<223> 



<400> 33 

ggctaggtga gccgtgggaa gaaaagaggg agcagctagg gcgcgggtct ccctcctccc 60 

ggagtttgga acggctgaag ttcaccttcc agcccctagc gccgttcgcg ccgctaggcc 120 

tggcttctga ggcggttgcg gtgctcggtc gccgcctagg cggggcaggg tgcgagcagg 180 

ggcttcgggc cacgcttctc ttggcgacag gattttgctg tgaagtccgt ccgggaaacg 240 

gaggaaaaaa agagttgcgg gaggctgtcg gctaataacg gttcttgata catatttgcc 300 

agacttcaag atttcagaaa aggggtgaaa gagaagattg caactttgag tcagacctgt 360 

aggcctgata gactgattaa accacagaag gtgacctgct gagaaaagtg gtacaaatac 420 

tgggaaaaac ctgctcttct gcgttaagtg ggagaca atg tea caa gtt aaa age 475 

Met Ser Gin Val Lys Ser 
1 5 
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tct tat tec tat gat gec ccc teg gat ttc ate aat ttt tea tec ttg 523 
Ser Tyr Ser Tyr Asp Ala Pro Ser Asp Phe lie Asn Phe Ser Ser Leu 
10 15-20 

gat gat gaa gga gat act caa aac ata gat tea tgg ttt gag gag aag 571 
Asp Asp Glu Gly Asp Thr Gin Asn He Asp Ser Trp Phe Glu Glu Lys 
25 30 35 

gee aat ttg gag aat aag tta ctg ggg aag aat gga act gga ggg ctt 619 
Ala Asn Leu Glu Asn Lys Leu Leu Gly Lys Asn Gly Thr Gly Gly Leu 
40 .45 , 50 

ttt cag ggc aaa act cct ttg aga aag get aat ctt cag caa get att 667 
Phe Gin Gly Lys Thr Pro Leu Arg Lys Ala Asn Leu Gin Gin Ala He 
55 60 65 70 

gtc aca cct ttg aaa cca gtt gac aac act tac tac aaa gag gca gaa 715 
Val Thr Pro Leu Lys Pro Val Asp Asn Thr Tyr Tyr Lys Glu Ala Glu 
75 80 85 

aaa gaa aat ctt gtg gaa caa tec att ccg tea aat get tgt tct tec 763 
Lys Glu Asn Leu Val Glu Gin Ser He Pro Ser Asn Ala Cys Ser Ser 
90 95 100 

ctg gaa gtt gag gca gee ata tea aga aaa act cca gee cag cct cag 811 
Leu Glu Val Glu Ala Ala He Ser Arg Lys Thr Pro Ala Gin Pro Gin 
105 HO 115 

aga aga tct ctt agg ctt tct get cag aag gat ttg gaa cag aaa gaa 859 
Arg Arg Ser Leu Arg Leu Ser Ala Gin Lys Asp Leu Glu Gin Lys Glu 
120 125 130 

aag cat cat gta aaa atg aaa gee aag aga tgt gec act cct gta ate 907 
Lys His His Val Lys Met Lys Ala Lys Arg Cys Ala Thr Pro Val He 
135 140 145 150 

ate gat gaa att eta ccc tct aag aaa atg aaa gtt tct aac aac aaa 955 
He Asp Glu He Leu Pro Ser Lys Lys Met Lys Val Ser Asn Asn Lys 
155 160 165 

aag aag cca gag gaa gaa ggc agt get cat caa gat act get gaa aag 1003 
Lys Lys Pro Glu Glu Glu Gly Ser Ala His Gin Asp Thr Ala Glu Lys 
170 175 180 

aat gca tct tec cca gag aaa gee aag ggt aga cat act gtg cct tgt 1051 
Asn Ala Ser Ser Pro Glu Lys Ala Lys Gly Arg His Thr Val Pro Cys 
185 190 195 



atg cca cct gca aag cag aag ttt eta aaa agt act gag gag caa gag 
Met Pro Pro Ala Lys Gin Lys Phe Leu Lys Ser Thr Glu Glu Gin Glu 
200 205 210 



1099 



ctg gag aag agt atg aaa atg cag caa gag gtg gtg gag atg egg aaa 1147 
Leu Glu Lys Ser Met Lys Met Gin Gin Glu Val Val Glu Met Arg Lys 
215 220 225 230 

aag aat gaa gaa ttc aag aaa ctt get ctg get gga ata ggg caa cct 1195 
Lys Asn Glu Glu Phe Lys Lys Leu Ala Leu Ala Gly He Gly Gin Pro 
235 240 245 

gtg aag aaa tea gtg age cag gtc acc aaa tea gtt gac ttc cac ttc 1243 
Val Lys Lys Ser Val Ser Gin Val Thr Lys Ser Val Asp Phe His Phe 
250 255 260 

cgc aca .gat gag cga ate aaa caa cat cct aag aac cag gag gaa tat 1291 
Arg Thr Asp Glu Arg He Lys Gin His Pro Lys Asn Gin Glu Glu Tyr 
265 270 275 
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aag gaa gtg aac ttt aca tct gaa eta cga aag cat cct tea tct cct 1339 
Lys Glu Val Asn Phe Thr Ser Glu Leu Arg Lys His Pro Ser Ser Pro 
280 285 290 

gec cga gtg act aag gga tgt acc att gtt aag cct ttc aac ctg tec 1387 
Ala Arg Val Thr Lys Gly Cys Thr lie Val Lys Pro Phe Asn Leu Ser 
295 300 305 310 

caa gga aag aaa aga aca ttt gat gaa aca gtt tct aca tat gtg ccc 1435 
Gin Gly Lys Lys Arg Thr Phe Asp Glu Thr Val Ser Thr Tyr Val Pro 
315 320 325 

ctt gca cag caa gtt gaa gac ttc cat aaa cga acc cct aac aga tat 1483 
Leu Ala Gin Gin Val Glu Asp Phe His Lys Arg Thr Pro Asn Arg Tyr 
330 335 340 

cat ttg agg age aag aag gat gat att aac ctg tta ccc tec aaa tct 1531 
His Leu Arg Ser Lys Lys Asp Asp lie Asn Leu Leu Pro Ser Lys Ser 
345 350 355 

tct gtg acc aag att tgc aga gac cca cag act cct gta ctg caa acc 1579 
Ser Val Thr Lys lie Cys Arg Asp Pro Gin Thr Pro Val Leu Gin Thr 
360 365 370 

aaa cac cgt gca egg get gtg acc tgc aaa agt aca gca gag ctg gag 1627 
Lys His Arg Ala Arg Ala Val Thr Cys Lys Ser Thr Ala Glu Leu Glu 
375 380 - 385 390 

get gag gag etc gag aaa ttg caa caa tac aaa ttc aaa gca cgt gaa 1675 
Ala Glu Glu Leu Glu Lys Leu Gin Gin Tyr Lys Phe Lys Ala Arg Glu 
395 400 405 

ctt gat ccc aga ata ctt gaa ggt ggg ccc ate ttg ccc aag aaa cca 1723 
Leu Asp Pro Arg lie Leu Glu Gly Gly Pro lie Leu Pro Lys Lys Pro 
410 415 420 

cct gtg aaa cca ccc acc gag cct att ggc ttt gat ttg gaa att gag 1771 
Pro Val Lys Pro Pro Thr Glu Pro lie Gly Phe Asp Leu Glu lie Glu 
425 430 ■ " 435 

aaa aga ate cag gag cga gaa tea aag aag aaa aca gag gat gaa cac 1819 
Lys Arg He Gin Glu Arg Glu Ser Lys Lys Lys Thr Glu Asp Glu His 
440 445 450 

ttt gaa ttt cat tec aga cct tgc cct act aag att ttg gaa gat gtt 1867 
Phe Glu Phe His Ser Arg Pro Cys Pro Thr Lys He Leu Glu Asp Val 
455 460 465 470 

gtg ggt gtt cct gaa aag aag gta ctt cca. ate acc gtc ccc aag tea 1915 
Val Gly Val Pro Glu Lys Lys Val Leu Pro He Thr Val Pro Lys Ser 
475 480 485 

cca gee ttt gca ttg aag aac aga att cga atg ccc acc aaa gaa gat 1963 
Pro Ala Phe Ala Leu Lys Asn Arg He Arg Met Pro Thr Lys Glu Asp 
490 495 500 

gag gaa gag gac gaa ccg gta gtg ata aaa get caa cct gtg cca cat 2011 
Glu Glu Glu Asp Glu Pro Val Val He Lys Ala Gin Pro Val Pro His 
505 510 515 

tat ggg gtg cct ttt aag ccc caa ate cca gag gca aga act gtg gaa 2059 
Tyr Gly Val Pro Phe Lys Pro Gin He Pro Glu Ala Arg Thr Val Glu 
520 525 530 

ata tgc cct ttc teg ttt gat tct cga gac aaa gaa cgt cag tta cag 2107 
He Cys Pro Phe Ser Phe Asp Ser Arg Asp Lys Glu Arg Gin Leu Gin 
535 540 545 ' 550 
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aag gag aag aaa ata aaa gaa ctg cag aaa ggg gag gtg ccc aag ttc 2155 
Lys Glu Lys Lys He Lys Glu Leu Gin Lys Gly Glu Val Pro Lys Phe 
555 560 565 

aag gca ctt ccc ttg cct cat ttt gac acc att aac ctg cca gag aag 2203 
Lys Ala Leu Pro Leu Pro His Phe Asp Thr He Asn Leu Pro Glu Lys 
570 575 580 

aag gta aag aat gtg acc cag att gaa cct ttc tgc ttg gag act gac 2251 
Lys Val Lys Asn Val Thr Gin He Glu Pro Phe Cys Leu Glu Thr Asp 
585 590 595 

aga aga ggt get ctg aag gca cag act tgg aag cac cag ctg gaa gaa 2299 
Arg Arg Gly Ala Leu Lys Ala Gin Thr Trp Lys His Gin Leu Glu Glu 
600 605 610 

gaa ctg aga cag cag aaa gaa gca get tgt ttc aag get cgt cca aac 2347 
Glu Leu Arg Gin Gin Lys Glu Ala Ala Cys Phe Lys Ala Arg Pro Asn 
615 620 625 630 

acc gtc ate tct cag gag ccc ttt gtt ccc aag aaa gag aag aaa tea 2395 
Thr Val He Ser Gin Glu Pro Phe Val Pro Lys Lys Glu Lys Lys Ser 
635 640 645 

gtt get gag ggc ctt tct ggt tct eta gtt cag gaa cct ttt cag ctg 2443 
Val Ala Glu Gly Leu Ser Gly Ser Leu Val Gin Glu Pro Phe Gin Leu 
650 655 660 

get act gag aag aga gec aaa gag egg cag gag ctg gag aag aga atg 2491 
Ala Thr Glu Lys Arg Ala Lys Glu Arg Gin Glu Leu Glu Lys Arg Met 
665 670 675 

get gag gta gaa gec cag aaa gec cag cag ttg gag gag gec aga eta 2539 
Ala Glu Val Glu Ala Gin Lys Ala Gin Gin Leu Glu Glu Ala Arg Leu 
680 685 690 

cag gag gaa gag cag aaa aaa gag gag ctg gec agg eta egg aga gaa 2587 
Gin Glu Glu Glu Gin Lys Lys Glu Glu Leu Ala Arg Leu Arg Arg Glu 
695 . 700 705 710 

ctg gtg cat aag gca aat cca ata cgc aag tac cag ggt ctg gag ata 2635 
.Leu Val His Lys Ala Asn Pro He Arg Lys Tyr Gin Gly Leu Glu He 
715 720 725 

aag tea agt gac cag cct ctg act gtg cct gta tct ccc aaa ttc tec 2683 
Lys Ser Ser Asp Gin Pro Leu Thr Val Pro Val Ser Pro Lys Phe Ser 
730 735 740 



act cga ttc cac tgc taaactcagc tgtgagctgc ggataccgcc cggcaatggg 
Thr Arg Phe His Cys 
745 


2738 


acctgctctt aacctcaaac 


etaggacegt 


ettgetttgt 


cattgggcat 


ggagagaacc 


2798 


catttctcca gacttttacc 


tacccgtgcc 


tgagaaagca 


tacttgacaa 


ctgtggactc 


2858 


cagttttgtt gagaattgtt 


ttcttacatt 


actaaggcta 


ataatgagat 


gtaactcatg 


2918 


aatgtctcga ttagactcca 


tgtagttact 


tcctttaaac 


catcagccgg 


ccttttatat 


2978 


gggtcttcac tctgactaga 


atttagtctc 


tgtgtcagca 


cagtgtaatc 


tetattgeta 


3038 


ttgcccctta cgactctcac 


cctctcccca 


ctttttttaa 


aaattttaac 


cagaaaataa 


3098 


agatagttaa atcctaagat 


agagattaag 


tcatggttta 


aatgaggaac 


aatcagtaaa 


3158 


tcagattctg tcctcttctc 


tgcataccgt 


gaatttatag 


ttaaggatcc 


ctttgctgtg 


3218 
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agggtagaaa acctcaccaa ctgcac'cagt gaggaagaag actgcgtgga ttcatgggga 3278 

gcctcacagc agccacgcag caggctctgg gtggggctgc cgttaaggca cgttctttcc 3338 

ttactggtgc tgataacaac agggaaccgt gcagtgtgca ttttaagacc tggcctggaa 3398 

taaatacgtt ttgtctttcc ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3458 



aaaaaaaaaa aa 

<210> 34 

<211> 747 

<212> PRT 

<213> NM_012112 C20orfl, chromosome 20 open reading frame 1 

<400> 34 

Met Ser Gin Val Lys Ser Ser Tyr Ser Tyr Asp Ala Pro Ser Asp Phe 
1 5 10 15 



He Asn Phe Ser Ser Leu Asp Asp Glu Gly Asp Thr Gin Asn He Asp 
20 25 30 



Ser Trp Phe Glu Glu Lys Ala Asn Leu Glu Asn Lys Leu Leu Gly Lys 
35 40 45 



Asn Gly Thr Gly Gly Leu Phe Gin Gly Lys Thr Pro Leu Arg Lys Ala 
50 55 60 



Asn Leu Gin Gin Ala He Val Thr Pro Leu Lys Pro Val Asp Asn Thr 
65 70 75 . 80 



Tyr Tyr Lys Glu Ala Glu Lys Glu Asn Leu Val Glu Gin Ser He Pro 
85 90 95 



Ser Asn Ala Cys Ser Ser Leu Glu Val Glu Ala Ala He Ser Arg Lys 
100 105 : " HO 



Thr Pro Ala Gin Pro Gin Arg Arg Ser Leu Arg Leu Ser Ala Gin Lys 
115 120 125 



Asp Leu Glu Gin Lys Glu Lys His His Val Lys Met Lys Ala Lys Arg 
130 135 140 



Cys Ala Thr Pro Val He He Asp Glu He Leu Pro Ser Lys Lys Met 
145 150 155 160 



3470 



Lys Val Ser Asn Asn Lys Lys Lys Pro Glu Glu Glu Gly Ser Ala His 
165 170 175 
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Gin Asp Thr Ala Glu Ly3 Asn Ala Ser Ser Pro Glu Lys Ala Lys Gly 
180 185 190 



Arg His Thr Val Pro Cys Met Pro Pro Ala Lys Gin Lys Phe Leu Lys 
195 ,200 " 205 



Ser Thr Glu Glu Gin Glu Leu Glu Lys Ser Met Lys Met Gin Gin Glu 
210 215 220 



Val Val Glu Met Arg Lys Lys Asn Glu Glu Phe Lys Lys Leu Ala Leu 
225 230 235 240 



Ala Gly He Gly Gin Pro Val Lys Lys Ser Val Ser Gin Val Thr Lys 
245 250 255 



Ser Val Asp Phe His Phe Arg Thr Asp Glu Arg He Lys Gin His Pro 
260 265 270 



Lys Asn Gin Glu Glu Tyr Lys Glu Val Asn Phe Thr Ser Glu Leu Arg 
275 280 285 



Lys His Pro Ser Ser Pro Ala Arg Val Thr Lys Gly Cys Thr He Val 
290 295 300 



Lys Pro Phe Asn Leu Ser Gin Gly Lys Lys Arg Thr Phe Asp Glu Thr 
305 310 315 320 



Val Ser Thr Tyr Val Pro Leu Ala Gin Gin Val Glu Asp Phe His Lys 
325 330 335 



Arg Thr Pro Asn Arg Tyr His Leu Arg Ser Lys Lys Asp Asp He Asn 
340 345 350 



Leu Leu Pro Ser Lys Ser Ser Val Thr Lys He Cys Arg Asp Pro Gin 
355 360 365 



Thr Pro Val Leu Gin Thr Lys His Arg Ala Arg Ala Val Thr Cys Lys 
370 375 380 



Ser Thr Ala Glu Leu Glu Ala Glu Glu Leu Glu Lys Leu Gin Gin Tyr 
385 390 395 400 



Lys Phe Lys Ala Arg Glu Leu Asp Pro Arg He Leu Glu Gly Gly Pro 
405 410 415 



He Leu Pro Lys Lys Pro Pro Val Lys Pro Pro Thr Glu Pro He Gly 
420 425 430 



Phe Asp Leu Glu He Glu Lys Arg He Gin Glu Arg Glu Ser Lys Lys 
435 440 445 
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Lys Thr Glu Asp Glu His Phe Glu Phe His Ser Arg Pro Cys Pro Thr 
450 455 460 



Lys lie Leu Glu Asp Val Val Gly Val Pro Glu Lys Lys Val Leu Pro 
465 470 475 480 



He Thr Val Pro Lys Ser Pro Ala Phe Ala Leu Lys Asn Arg He Arg 
485 490 495 



Met Pro Thr Lys Glu Asp Glu Glu Glu Asp Glu Pro Val Val He Lys 
500 505 510 



Ala Gin Pro Val Pro His Tyr Gly Val Pro Phe Lys Pro Gin He Pro 
515 520 525 



Glu Ala Arg Thr Val Glu lie Cys Pro Phe Ser Phe Asp Ser Arg Asp 
530 535 54 0 



Lys Glu Arg Gin Leu Gin Lys Glu Lys Lys He Lys Glu Leu Gin Lys 
545 550 555 560 



Gly Glu Val Pro Lys Phe Lys Ala Leu Pro Leu Pro His Phe Asp Thr 
565 570 575 



He Asn Leu Pro Glu Lys Lys Val Lys Asn Val Thr Gin He Glu Pro 
580 585 590 



Phe Cys Leu Glu Thr Asp Arg Arg Gly Ala Leu Lys Ala Gin Thr Trp 
595 600 605 



Lys His Gin Leu Glu Glu Glu Leu Arg Gin Gin Lys Glu Ala Ala Cys 
610 615 620 



Phe Lys Ala Arg Pro Asn Thr Val He Ser Gin Glu Pro Phe Val Pro 
625 630 635 640 



Lys Lys Glu Lys Lys Ser Val Ala Glu Gly Leu Ser Gly Ser Leu Val 
645 650 655 



Gin Glu Pro Phe Gin Leu Ala Thr Glu Lys Arg Ala Lys Glu Arg Gin 
660 665 670 



Glu Leu Glu Lys Arg Met Ala Glu Val Glu Ala Gin Lys Ala Gin Gin 
675 680 685 



Leu Glu Glu Ala Arg Leu Gin Glu Glu Glu Gin Lys Lys Glu Glu Leu 
690 695 700 



Ala Arg Leu Arg Arg Glu Leu Val His Lys Ala Asn Pro He Arg Lys 
705 710 715 720 
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Tyr Gin Gly Leu Glu lie Lys Ser Ser Asp Gin Pro Leu Thr Val Pro 
725 730 735 



Val Ser Pro Lys Phe Ser Thr Arg Plie His Cys 
740 745 

<210> 35 

<211> 1246 

<212> DNA 

<213> NM 001769 CD9 



<220> 

<221> CDS 

<222> (112) . . (795) 

<223> 



<400> 35 

gaccagccta cagccgcctg catctgtatc cagcgccagg tcccgccagt cccagctgcg 60 

cgcgcccccc agtcccgcac ccgttcggcc caggctaagt tagccctcac c atg ccg 117 

Met Pro 
1 

gtc aaa gga ggc acc aag tgc ate aaa tac ctg ctg ttc gga ttt aac 165 
Val Lys Gly Gly Thr Lys Cys lie Lys Tyr Leu Leu Phe Gly Phe Asn 
5 10 15 

ttc ate ttc tgg ctt gee ggg att get gtc ctt gec att gga eta tgg 213 
Phe lie Phe Trp Leu Ala Gly lie Ala Val Leu Ala lie Gly Leu Trp 
20 . 25 30 

etc cga ttc gac tct cag acc aag age ate ttc gag caa gaa act aat 2 61 

Leu Arg Phe Asp Ser Gin Thr Lys Ser lie Phe Glu Gin Glu Thr Asn 
35 40 45 50 

aat aat aat tec age ttc tac aca gga gtc tat att ctg ate gga gee 309 
Asn Asn Asn Ser Ser Phe Tyr Thr Gly Val Tyr lie Leu lie Gly Ala 
55 60 65 

ggc gee etc atg atg ctg gtg ggc ttc ctg ggc tgc tgc ggg get gtg 357 
Gly Ala Leu Met Met Leu Val Gly Phe Leu Gly Cys Cys Gly Ala Val 
70 75 80 

cag gag tec cag tgc atg ctg gga ctg ttc ttc ggc ttc etc ttg gtg 405 
Gin Glu Ser Gin Cys Met Leu Gly Leu Phe Phe Gly Phe Leu Leu Val 
85 90 95 

ata ttc gee att gaa ata get gcg gee ate tgg gga tat tec cac aag 453 
lie Phe Ala He Glu He Ala Ala Ala He Trp Gly Tyr Ser His Lys 
100 105 110 

gat gag gtg att aag gaa gtc cag gag ttt tac aag gac acc tac aac 501 
Asp Glu Val He Lys Glu Val Gin Glu Phe Tyr Lys Asp Thr Tyr Asn 
115 120 . 125 130 

aag ctg aaa acc aag gat gag ccc cag egg gaa acg ctg aaa gee ate 549 
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Lys Leu Lys Thr Lys Asp Glu Pro Gin Arg Glu Thr Leu Lys Ala lie 
135 140 145 

cac tat gcg ttg aac tgc tgt ggt ttg get ggg ggc gtg gaa.cag ttt 597 
His Tyr Ala Leu Asn Cys Cys Gly Leu Ala Gly Gly Val Glu Gin Phe 
150 155 160 

ate tea gac ate tgc ccc aag aag gac gta etc gaa ace ttc ace gtg 645 
He Ser Asp He Cys Pro Lys Lys Asp Val Leu Glu Thr Phe Thr Val 
165 170 175 

aag tec tgt cct gat gec ate aaa gag gtc ttc gac aat aaa ttc cac 693 
Lys Ser Cys Pro Asp Ala He Lys Glu Val Phe Asp Asn Lys Phe His 
180 185 190 

ate ate ggc gca gtg ggc ate ggc att gee gtg gtc atg ata ttt ggc 741 
He lie Gly Ala Val Gly He Gly He Ala Val Val Met He Phe Gly 
195 200 205 210 

atg ate ttc agt atg ate ttg tgc tgt get ate cgc agg aac cgc gag 789 
Met He Phe Ser Met He Leu Cys Cys Ala He Arg Arg Asn Arg Glu 
215 220 225 

atg gtc tagagtcagc ttacatccct gagcaggaaa gtttacccat gaagattggt 845 
Met Val 



gggatttttt gtttgtttgt tttgttttgt ttgttgtttg ttgtttgttt ttttgecact 905 

aattttagta ttcattctgc attgetagat aaaagctgaa gttactttat gtttgtcttt 965 

taatgettea ttcaatattg acatttgtag ttgagcgggg ggtttggttt gctttggttt 1025 

atattttttc agttgtttgt ttttgcttgt tatattaagc agaaatcctg caatgaaagg 1085 

tactatattt gctagactct agacaagata ttgtacataa aagaattttt ttgtctttaa 1145 

atagatacaa atgtctatca actttaatca agttgtaact tatattgaag acaatttgat 1205 

acataataaa aaattatgac aatgtcaaaa aaaaaaaaaa a 124 6 

<210> 36 

<211> 228 

<212> PRT 

<213> NM_001769 CD9 

<400> 36 

Met Pro Val Lys Gly Gly Thr Lys Cys He Lys Tyr Leu Leu Phe Gly 
1 5 10 15 



Phe Asn Phe He Phe Trp Leu Ala Gly He Ala Val Leu Ala He Gly 
20 25 30 



Leu Trp Leu Arg Phe Asp Ser Gin Thr Lys Ser He Phe Glu Gin Glu 
35 40 45 



Thr Asn Asn Asn Asn Ser Ser Phe Tyr Thr Gly Val Tyr He Leu He 
50 55 60 
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Gly Ala Gly Ala Leu Met Met Leu Val Gly Phe Leu Gly Cys Cys Gly 
65 70 75 80 



Ala Val Gin Glu Ser Gin Cys Met Leu Gly Leu Phe Phe- Gly Phe Leu 
85 90 95 



Leu Val He Phe Ala He Glu He Ala Ala Ala He Trp Gly Tyr Ser 
100 105 110 



His Lys Asp Glu Val He Lys Glu Val Gin Glu Phe Tyr Lys Asp Thr 
115 120 125 



Tyr Asn Lys Leu Lys Thr Lys Asp Glu Pro Gin Arg Glu Thr Leu Lys 
130 135 . 140 



Ala He His Tyr Ala Leu Asn Cys Cys Gly Leu Ala Gly Gly Val Glu 
145 150 155 160 



Gin Phe He Ser Asp He Cys Pro Lys Lys Asp Val Leu Glu Thr Phe 
165 170 175 



Thr Val Lys Ser Cys Pro Asp Ala He Lys Glu Val Phe Asp Asn Lys 
180 185 190 



Phe His He He Gly Ala Val Gly He Gly He Ala Val Val Met He 
195 200 205 



Phe Gly Met He Phe Ser Met He Leu Cys Cys Ala He Arg Arg Asn 
210 215 220 



Arg Glu Met Val 
225 



<210> 37 

<211> 7557 

<212> DNA 

<213> NM_020859 ShrmL, Shroom-related protein 



<220> 

<221> CDS 

<222> (958) (6942) 

<223> 



<400> 37 

ggttattaac tgtaagactg tttgaattgt caggtctttg tggtgagctg aattctgcat 60 
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ggcagtgctt actgtgcaga aaggttgcaa tgcctcactt tgccttaatc catgttcctc 120 

taacttcctt actctttctc tcaagaggag gcaagtggct gtggcggccg cagcagtggc 180 

tgatcatcac tgaaaatacc aaagaaaaga actgagctgc ctccttcata ttttttccat 240 

tgaggattaa tttaccgtgc tttttcattt tctctacatc ctgcaaaagt ttttttctct 300 

cctaagaaac aaactatgaa ctgattgttg aaaaaaagaa gtaaaaagtt ttagcacagc 360 

ttctctgtct cttcgggaca agttagaaaa ttctgaagtg agccgaagca tagtaagtgc 420 

tttctttctt tttaagctac ttctggggag ggaggaggct attgtaatgg taaatttcac 480 

tccgagagga agaaagggtt gataatcaat caaaaatgag gtattccttt gagtatttgt 540 

gattttctta ctatattgta agatgctttt aattttctct gtaaaatagg cagaaatggt 600 

tttagtgtgt gtatgtgtga aataaaagct cagaaaagca atcttcagag cgccactgaa 660 

ggaagttttg acgaacggag tagagatgta taccacttgg gggcttcagt gagaacccag 720 

aattcctgga ggaggattta cattcagaaa tgttgaagtg aaaattcctt ctggttcagc 780 

atcttggagt tcagcttgga agaacatttt acgtatggaa gaatttgctt ctccaaacct 840 

ctcttttggc cattgtgtgt cctgaaggat gggacaactt gtgctgtaga agcactgctt 900 

gcctgagttt gcttcaggca ttttaaattt aacttgaggg atcatgtgtt tggcatg 957 

atg agg acc act gaa gac ttc cac aag cct agt gcc aca tta aac tct 1005 
Met Arg Thr Thr Glu Asp Phe His Lys Pro Ser Ala Thr Leu Asn Ser 
15 10 15 

aac acg gcc acc aag gga agg tac att tat ctg gag gca ttc ctg gag 1053 
Asn Thr Ala Thr Lys Gly Arg Tyr He Tyr Leu Glu Ala Phe Leu Glu 
20 25 30 

gga gga get ccc tgg ggt ttt act eta aag ggt ggc ctg gag cac gga 1101 
Gly Gly Ala Pro Trp Gly Phe Thr. Leu Lys Gly Gly Leu Glu His Gly 
35 40 45 

gaa cca tta ate ate tct aag gtc gaa gaa ggg ggc aaa gca gac acc 1149 
Glu Pro Leu He He Ser Lys Val Glu Glu Gly Gly Lys Ala Asp Thr 
50 55 60 

ctg age tec aaa ctg cag get ggg gat gag gtt gtg cac ate aat gag 1197 
Leu Ser Ser Lys Leu Gin Ala Gly Asp Glu Val Val. His He Asn Glu 
"65 70 75 80 

gtg act ctg age age tec aga aag gag gca gtt tec ctg gtg aaa gga 1245 
Val Thr Leu Ser Ser Ser Arg Lys Glu Ala Val Ser Leu Val Lys Gly 
85 90 95 

tec tac aag acc etc agg ctg gta gtg cgc aga gat gtg tgc aca gac 1293 
Ser Tyr Lys Thr Leu Arg Leu Val Val Arg Arg Asp Val Cys Thr Asp 
100 105 110 

cca ggc cat gca gat act ggt gcc tct aac ttc gtc age cca gaa cac 1341 
Pro Gly His Ala Asp Thr Gly Ala Ser Asn Phe Val Ser Pro Glu His 
115 120 125 

etc acc tct ggc ccc cag cac agg aaa gca gcg tgg tea gga ggg gtt 1389 
Leu Thr Ser Gly Pro Gin His Arg Lys Ala Ala Trp Ser Gly Gly Val 
130 135 140 

aaa ctt egg ctg aag cac agg tct agt gag cct gca ggc cga cct cac 1437 
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Lys Leu Arg Leu Lys His Arg Ser Ser Glu Pro Ala Gly Arg Pro His 
145 150 . 155 160 

teg tgg cac aca act aaa tct ggg gag aag caa ccc gat gec age atg 1485 
Ser Trp His Thr Thr Lys Ser Gly Glu Lys Gin Pro Asp Ala Ser Met 
165 170 175 

atg cag ata tct cag ggt atg ate ggc cct cct tgg cac caa age tac 1533 
Met Gin He Ser Gin Gly Met He Gly Pro Pro Trp His Gin Ser Tyr 
180 185 190 

cat tec age tec tct act agt gac etc tec aac tat gac cat get tat 1581 
His Ser Ser Ser Ser Thr Ser Asp Leu Ser Asn Tyr Asp His Ala Tyr 
195 200 205 

eta agg egg age cct gac cag tgc age tec cag ggg age atg gag age 1629 
Leu Arg Arg Ser Pro Asp Gin Cys Ser Ser Gin Gly Ser Met Glu Ser 
210 215 220 

ctg gag ccc agt ggg gca tac cca ccc tgt cat ctt tec cct gee aag 1677 
Leu Glu Pro Ser Gly Ala Tyr Pro Pro Cys His Leu Ser Pro Ala Lys 
225 230 235 240 

tec ace ggc age att gac cag etc age cac ttc cat aac aag aga gac 1725 
Ser Thr Gly Ser He Asp Gin Leu Ser His Phe His Asn Lys Arg Asp 
245 '250 255 

teg get tac age tct ttc tec ace agt tct age ate eta gag tat cca 1773 
Ser Ala Tyr Ser Ser Phe Ser Thr Ser Ser Ser He Leu Glu Tyr Pro 
260 265 270 

cac cct ggc ate tct gee egg gag cgt tea ggc tec atg gac aat act 1821 
His Pro Gly He Ser Ala Arg Glu Arg Ser Gly Ser Met Asp Asn Thr 
275 280 285 

tct get cga ggt ggc etc etc gaa ggg atg agg cag gca gat att cgc 1869 
Ser Ala Arg Gly Gly Leu Leu Glu Gly Met Arg Gin Ala Asp He Arg 
290 295 300 

tat gtc aag aca gtc tat gac ace egg agg gga gtc tea gca gag tat 1917 
Tyr Val Lys Thr Val Tyr Asp Thr Arg Arg Gly Val Ser Ala Glu Tyr 
305 310 315 320 

gag gtg aac tct tea gee ctg ctg ctt caa ggt agg gag gee cga gee 1965 
Glu Val Asn Ser Ser Ala Leu Leu Leu Gin Gly Arg Glu Ala Arg Ala 
325 330 335 

tea gca aat ggt cag ggc tat gat aaa tgg tct aat att cct egg ggc 2013 
Ser Ala Asn Gly Gin Gly Tyr Asp Lys Trp Ser Asn lie Pro Arg Gly 
340 345 350 

aag gga gtg cca ccc cca tec tgg age cag cag tgc ccc agt tec ttg 2061 
Lys Gly Val Pro Pro Pro Ser Trp Ser Gin Gin Cys Pro Ser Ser Leu 
355 360 365 

gag act gee acg gac aac ctt cct cct aag gtg ggt gca ccc ctg cct 2109 
Glu Thr Ala Thr Asp Asn Leu Pro Pro Lys Val Gly Ala Pro Leu Pro 
370 375 380 

cca get egg agt gac agt tac gca gca ttt egg cac cgt gag egg ccc 2157 
Pro Ala Arg Ser Asp Ser Tyr Ala Ala Phe Arg His Arg Glu Arg Pro 
385 390 395 400 

age tec tgg tct age ctt gat cag aaa egg etc tgc egg cct cag gca 2205 
Ser Ser Trp Ser Ser Leu Asp Gin Lys Arg Leu Cys Arg Pro Gin Ala 
405 410 415 

aac tct tta ggc tec ctg aag tct cca ttc ata gag gag cag ctg cat 2253 
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Asn Ser Leu Gly Ser Leu Lys Ser Pro Phe He Glu Glu Gin Leu His 
420 425 430 

act gtg ctg gag aag agt cca gag aac age ccc cca gtg aag ccc aag 2301 
Thr Val Leu Glu Lys Ser Pro Glu Asn Ser Pro Pro Val Lys Pro Lys 
435 .440 445 

cat aac tat acc cag aag gec caa cct ggc caa cct ctg ctg ccg acc 2349 
His Asn Tyr Thr Gin Lys Ala Gin Pro Gly Gin Pro Leu Leu Pro Thr 
450 455 460 

age ate tac gcg gta cct tec ctg gag cca cac ttt gec cag gtg cct 2397 
Ser He Tyr Ala Val Pro Ser Leu Glu Pro His Phe Ala Gin Val Pro 
465 470 475 480 

cag cct tct gtg agt age aac ggt atg etc tac cct gca ctg gee aag 2445 
Gin Pro Ser Val Ser Ser Asn Gly Met Leu Tyr Pro Ala Leu Ala Lys 
485 490 495 

gag agt gga tac ata gee cct cag gga gca -tgc aac aag atg get acc 2493 
Glu Ser Gly Tyr He Ala Pro Gin Gly Ala Cys Asn Lys Met Ala Thr 
500 505 . 510 

att gat gag aat ggg aac cag aat gga tct ggc agg cct ggg ttt gee 2541 
He Asp Glu Asn Gly Asn Gin Asn Gly Ser Gly Arg Pro Gly Phe Ala 
515 520 525 

ttc tgc cag ccc tta gaa cat gac ttg ctg tec cca gtg gag aag aaa 2589 
Phe Cys Gin Pro Leu Glu His Asp Leu Leu Ser Pro Val Glu Lys Lys 
530 535 540 

cca gaa get aca gee aag tat gtc ccc tec aaa gtc cat ttc tgt tea 2637 
Pro Glu Ala Thr Ala Lys Tyr Val Pro Ser Lys Val His Phe Cys Ser 
545 550 555 560 

gtg cct gaa aat gag gag gat gee tec ctg aag aga cat etc aca cct 2685 
Val Pro Glu Asn Glu Glu Asp Ala Ser Leu Lys Arg His Leu Thr Pro 
565 570 575 

ccc caa ggc aac age cca cat tec aat gag aga aag age acc cac agt 2733 
Pro Gin Gly Asn Ser Pro His Ser Asn Glu Arg Lys Ser Thr His Ser 
580 585 590 

aac aaa cca tct tct cat ccc cac age etc aaa tgc cct cag get cag 2781 
Asn Lys Pro Ser Ser His Pro His Ser Leu Lys Cys Pro Gin Ala Gin 
595 600 605 

gee tgg caa gcg ggt gaa gac aag aga tct tec agg etc tea gag ccc 2829 
Ala Trp Gin Ala Gly Glu Asp Lys Arg Ser Ser Arg Leu Ser Glu Pro 
610 615 620 

tgg gag ggc gat ttc cag gaa gac cac aat gee aac etc tgg agg agg 2877 
Trp Glu Gly Asp Phe Gin Glu Asp His Asn Ala Asn Leu Trp Arg Arg 
625 630 635 * 640 

ctg gag aga gaa ggc eta ggc cag age ctg tea ggc aac ttt ggc aag 2925 
Leu Glu Arg Glu Gly Leu Gly Gin Ser Leu Ser Gly Asn Phe Gly Lys 
645 650 ~ 655 

acc aag tea gec ttc tea tct etc cag aac att cct gag agt ctg aga 2973 
Thr Lys Ser Ala Phe Ser Ser Leu Gin Asn He Pro Glu Ser Leu Arg 
660 665 670 

aga cac age age ctg gag eta ggc egg gga acc cag gag ggt tac ccc 3021 
Arg His Ser Ser Leu Glu Leu Gly Arg Gly Thr Gin Glu Gly Tyr Pro 
675 680 685 

ggg ggc agg ccc acc tgt gca gtc aac acc aag gca gaa gac cct ggg 3069 
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Gly Gly Arg Pro Thr Cys Ala Val Asn Thr Lys Ala Glu Asp Pro Gly 
690 695 700 

agg aaa gcc get cct gac etc ggg age cat ctg gac egg cag gtt tec 3117 
Arg Lys Ala Ala Pro Asp Leu Gly Ser His Leu Asp Arg Gin Val Ser 
705 710 715 720 

tac ccg egg ccc gag ggg agg ace ggt gcc teg get tct ttc aac age 3165 
Tyr Pro Arg Pro Glu Gly Arg Thr Gly Ala Ser Ala Ser Phe Asn Ser 
725 730 735 

aca gac cca agt ccc gaa gag ccg cct gcc ccc teg cac ccg cac aca 3213 
Thr Asp Pro Ser Pro Glu Glu Pro Pro Ala Pro Ser His Pro His Thr 
740 745 750 

tec agt ctg ggc egg agg ggg ccc ggc cca ggc age gcc teg get ctt 3261 
Ser Ser Leu Gly Arg Arg Gly Pro Gly Pro Gly Ser Ala Ser Ala Leu 
755 760 765 

cag ggc ttt cag tac ggg aag ccc cac tgc teg gtg ctg gag aag gtc 3309 
Gin Gly Phe Gin Tyr Gly Lys Pro His Cys Ser Val Leu Glu Lys Val 
770 775 780 

tec aaa ttc gag cag cga gag caa ggg age cag aga ccg agt gtg ggc 3357 
Ser Lys Phe Glu Gin Arg Glu Gin Gly Ser Gin Arg Pro Ser Val Gly 
785 790 795 800 

ggc tct ggt ttt ggc cat aac tat agg ccc cac agg ace gtc tea act 3405 
Gly Ser Gly Phe Gly His Asn Tyr Arg Pro His Arg Thr Val Ser Thr 
805 810 815 

tec agt act tct ggg aat gac ttc gag gag aca aaa gca cac att cgt 3453 
Ser Ser Thr Ser Gly Asn Asp Phe Glu Glu Thr Lys Ala His lie Arg 
820 ' 825 830 

ttc tct gag tea get gaa ccc eta ggc aac ggg gag cag cac ttc aaa 3501 
Phe Ser Glu Ser Ala Glu Pro Leu Gly Asn Gly Glu Gin His Phe Lys 
835 840 845 

aac ggg gag ctg aag ttg gaa gag get tec egg cag ccc tgc ggt cag 3549 
Asn Gly Glu Leu Lys Leu Glu Glu Ala Ser Arg Gin Pro Cys Gly Gin 
850 855 860 

cag ctg age gga gga gcg teg gac age ggc cgt ggc ccc cag agg ccg 3597 
Gin Leu Ser Gly Gly Ala Ser Asp Ser Gly Arg Gly Pro Gin Arg Pro 
865 870 875 880 

gac get egg etc etc cgt age cag age ace ttc cag etc tec age gag 3645 
Asp Ala Arg Leu Leu Arg Ser Gin Ser Thr Phe Gin . Leu Ser Ser Glu 
885 '890 895 

cca gag agg gag ccc gag tgg egg gac agg ccc ggc teg ccc gaa teg 3693 
Pro Glu Arg Glu Pro Glu Trp Arg Asp Arg Pro Gly Ser Pro Glu Ser 
900 905 910 

ccc ctg ctg gat gcc ccc ttc age cgc gcc tac egg aac age ate aag 3741 
Pro Leu Leu Asp Ala Pro Phe Ser Arg Ala Tyr Arg Asn Ser lie Lys 
915 920 925 

gac gca cag tec cgt gtc ttg ggg gcc ace tec ttt cga cgt cga gac 3789 
Asp Ala Gin Ser Arg Val Leu Gly Ala Thr Ser Phe Arg Arg Arg Asp 
930 935 940 

ctg gag ctg ggg gcg ccc gtg gcg teg agg tec tgg egg cca egg cct 3837 
Leu Glu Leu Gly Ala Pro Val Ala Ser Arg Ser Trp Arg Pro Arg Pro 
945 950 955 960 

tec teg gcc cac gtg ggg ctg egg age ccc gag gcg teg gcc tec gcc 3885 
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Ser Ser Ala His Val Gly Leu Arg Ser Pro Glu Ala Ser Ala Ser Ala 
965 970 . 975 



tec ccg cac acg ccc egg gag egg cac age gtg ace cct get gag ggc 
Ser Pro His Thr Pro Arg Glu Arg His Ser Val Thr Pro Ala Glu Gly 
980 985 990 



3933 



gac ctg gee agg ccc gtg ccc cct gec gee egg aga ggt get cgc egg 
Asp Leu Ala Arg Pro Val Pro Pro Ala Ala Arg Arg Gly Ala Arg Arg 
995 1000 1005 



3981 



cgc ctg act ccc gag cag aag aag cgc tec tac teg gag ccc gag 
Arg Leu ' Thr Pro Glu Gin Lys Lys Arg Ser Tyr Ser Glu Pro Glu 
1010 1015 " 1020 



4026 



aag atg aac gag gtg ggg ate gtg gag gag gec gaa ccg gca ccc 
Lys Met Asn Glu Val Gly lie Val Glu Glu Ala Glu Pro Ala Pro 
1025 1030 1035 



4071 



ctg ggc ccg cag aga aat ggg atg cgt ttc ccg gag age age gtg 
Leu Gly Pro Gin Arg Asn Gly Met Arg Phe Pro Glu Ser Ser Val 
1040 1045 1050 



4116 



gec gac egg cgc cgt etc , ttc gag cgc gat ggc aag gec tgc tec 
Ala Asp Arg Arg Arg Leu Phe Glu Arg Asp Gly Lys Ala Cys Ser 
1055 1060 ~ 1065 



4161 



acg etc age ctg teg ggg ccc 
Thr Leu Ser Leu Ser Gly Pro 
1070 1075 



gag ctg aag cag ttc 
Glu Leu Lys Gin Phe 
1080 



cag cag age 
Gin Gin Ser 



4206 



gee ctg gcg gac tac ate cag cgc aag ace ggc aag egg cct ace 
Ala Leu Ala Asp Tyr lie Gin Arg Lys Thr Gly Lys Arg Pro Thr 
1085 1090 1095 



4251 



tec gee gec ggc tgc age etc cag gag ccc ggg cca ctg cgt gag 
Ser Ala Ala Gly Cys Ser Leu Gin Glu Pro Gly Pro Leu Arg Glu 
1100 1105 1110 



4296 



cgc gec cag agt gee tac etc cag ccc ggc ccc gcg gcg etc gaa 
Arg Ala Gin Ser Ala Tyr Leu Gin Pro Gly Pro Ala Ala Leu Glu 
1115 1120 1125 



4341 



ggc tec ggc etc gee teg gee 
Gly Ser Gly Leu Ala Ser Ala 
1130 1135 



tec age ttg age tea 
Ser Ser Leu Ser Ser 
1140 



ctg egg gag 
Leu Arg Glu 



4386 



ccc age ctg cag ccc cgc agg 
Pro Ser Leu Gin Pro Arg Arg 
1145 1150 



gag gee acg etc ctg 
Glu Ala Thr Leu Leu 
1155 



ccg gee aca 
Pro Ala Thr 



4431 



gtt gca gaa ace cag cag get ccc cga gat cgc age age tec ttc 
Val Ala Glu Thr' Gin Gin Ala Pro Arg Asp Arg Ser Ser Ser Phe 
1160 1165 1170 



4476 



gee ggt ggc cgc cgc etc ggg 
Ala Gly Gly Arg Arg Leu Gly 
1175 1180 



gaa egg cga cgc ggg 
Glu Arg Arg Arg Gly 
1185 



gac ctg ctt 
Asp Leu Leu 



4521 



age gga gca aac ggt gga aca 
Ser Gly Ala Asn Gly Gly Thr 
1190 1195 



agg ggc ace cag aga 
Arg Gly Thr Gin Arg 
1200 



ggg gat gag 
Gly Asp Glu 



4566 



ace ccc agg gag cca tec tec tgg ggg gee agg gec ggg aag tec 
Thr Pro Arg Glu Pro Ser Ser Trp Gly Ala Arg Ala Gly Lys Ser 
1205 1210 1215 



4611 



atg teg gec gag gac ctg ctg gaa cgc teg gac gtc ctt gcg ggc 



4656 
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Met Ser Ala Glu Asp Leu Leu Glu Arg Ser Asp Val Leu Ala Gly 
1220 1225. 1230 



cct gtc cat gtg agg. tec agg tea tct ccc gec acc 
Pro Val His Val Arg Ser Arg Ser Ser Pro Ala Thr 
1235 1240 1245 



cgc cag gat gtg ctt ttg ggg 
Arg Gin Asp Val Leu Leu Gly 
1250 1255 



caa gac agt ggc ttt 
Gin Asp Ser Gly Phe 
1260 



gca gac aag 
Ala Asp Lys 



ggt ctt gtg 
Gly Leu Val 



4701 



4746 



aag gat cca tgt tat. ttg get ggt cct gga tct agg tea etc agt 
Lys Asp Pro Cys Tyr Leu Ala Gly Pro Gly Ser Arg Ser Leu Ser 
1265 1270 1275 



4791 



tgt tea gaa aga ggc caa gaa gag atg ctg ctg etc ttc cac cat 
Cys Ser Glu Arg Gly Gin Glu Glu Met Leu Leu Leu Phe His His 
1280 1285 1290 



4836 



etc acc cct cgt tgg ggt ggt tea ggc tgc aaa gee 
Leu Thr Pro Arg Trp Gly Gly Ser Gly Cys Lys Ala 
1295 1300 " 1305 



att ggt gat 
lie Gly Asp 



4881 



tec tec gtt cct agt gaa tgt 
Ser Ser Val Pro Ser Glu Cys 
1310 1315 



cct gga acc ctg gac cat cag agg 
Pro Gly Thr Leu Asp His Gin Arg 
1320 



4926 



caa gec ' agt agg aca ccc tgc ccc agg cca cca ctg 
Gin Ala Ser Arg- Thr Pro Cys Pro Arg. Pro Pro Leu 
1325 1330 1335 



gca gga acg 
Ala Gly Thr 



4971 



caa ggg ctg gtc aca gac acc agg get gca ccc ctg acc cca att 
Gin Gly Leu Val Thr Asp Thr Arg Ala Ala Pro Leu Thr Pro He 
1340 1345 1350 



5016 



ggc acc cct ctg cct tea gec 
Gly Thr Pro Leu Pro Ser Ala 
1355 1360 



att ccc tct ggc tac tgc tea cag 
He Pro Ser Gly Tyr Cys Ser Gin 
1365 



5061 



gac ggt cag aca ggg cga cag 
Asp Gly Gin Thr Gly Arg Gin 
1370 1375 



cct etc ccg ccc tac acc cct gec 
Pro Leu Pro Pro Tyr Thr Pro Ala 
1380 



5106 



atg atg cac aga age aat ggt cac acc ctg acc cag cct ccc ggt 
Met Met His Arg Ser Asn Gly His Thr Leu Thr Gin Pro Pro Gly 
1385 1390 1395 



5151 



cca aga ggc tgt gag ggc gat ggc cca gag cat ggg gta gaa gag 
Pro Arg Gly Cys Glu Gly Asp Gly Pro Glu His Gly Val Glu Glu 
1400 1405 1410 



5196 



gga acg agg aag agg gtc teg ctg cct cag tgg cca cct cct tct 
Gly Thr Arg Lys Arg Val Ser Leu Pro Gin Trp Pro Pro Pro Ser 
1415 1420 1425 



5241 



cga gca aag tgg gec cac gca 
Arg Ala Lys Trp Ala His Ala 
1430 1435 



gee aga gag gac age ctt cct gag 
Ala Arg Glu Asp Ser Leu Pro Glu 
1440 



5286 



gaa tec tea gec cct gat ttt gca aac ctg aag cac tat caa aaa 
Glu Ser Ser Ala Pro Asp Phe Ala Asn Leu Lys His Tyr Gin Lys 
1445 1450 1455 



5331 



cag cag agt ctt cca agt tta tgc age act tct gac cca gac aca 
Gin Gin Ser Leu Pro Ser Leu Cys Ser Thr Ser Asp Pro Asp Thr 
1460 1465 1470 



5376 



cct ctt ggg gec ccg age act cca ggg agg ate tec etc cga ata 



5421 
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Pro Leu Gly Ala Pro Ser Thr Pro Gly Arg lie Ser Leu Arg He 
1475 1480 1485 

tct gag tct gtc ctg egg gac tec ccg cca cct cat gag gat tat 
Ser Glu Ser Val Leu Arg Asp Ser Pro Pro Pro His Glu Asp Tyr 
1490 1495 1500 

gaa gac gaa gtg ttt gtg agg gat ccg cac ccc aag gec acg tec 
Glu Asp Glu Val Phe Val Arg Asp Pro His Pro Lys Ala Thr Ser 
1505 1510 1515 

age ccc aca ttt gaa cct ctt ccc cca ccc cca cct cct cca ccg 
Ser Pro Thr Phe Glu Pro Leu Pro Pro Pro Pro Pro Pro Pro Pro 
1520 1525 1530 

agt cag gaa acc ccg gtg tat age atg gat gac ttc cct cca cct 
Ser Gin Glu Thr Pro Val Tyr Ser Met Asp Asp Phe Pro Pro Pro 
1535 1540 1545 

cct ccc cac act gta tgt gag gcg cag ctg gac agt gag gat ccc 
Pro Pro His Thr Val Cys Glu Ala Gin Leu Asp Ser Glu Asp Pro 
1550 1555 1560 

gag ggg cca cgc ccc age ttc aac aaa ctt tct aaa gtg aca att 
Glu Gly Pro Arg Pro Ser Phe Asn Lys Leu Ser Lys Val Thr He 
1565 1570 1575 

gca agg gaa agg cac atg cct ggt gca gee cat gtg gta ggt agt 
Ala Arg Glu Arg His Met Pro Gly Ala Ala His Val Val Gly Ser 
1580 1585 1590 

cag aca ctg get tec aga etc caa act tct ate aag ggt tea gag 
Gin Thr Leu Ala Ser Arg Leu Gin Thr Ser He Lys Gly Ser Glu 
1595 1600 1605 

get gag tec aca cca ccc tec ttc atg age gtt cac gee caa ctt 
Ala Glu Ser Thr Pro Pro Ser Phe Met Ser Val His Ala Gin Leu 
1610 1615 1620 



get ggg tct ctt ggt ggg cag cca gca* ccc ate cag act caa age 
Ala Gly Ser Leu Gly Gly Gin Pro Ala Pro He Gin Thr Gin Ser 
1625 1630 1635 

etc age cat gat cca gtc agt gga act cag ggt tta gaa aag aaa 
Leu Ser His Asp Pro Val Ser Gly Thr Gin Gly Leu Glu Lys Lys 
1640 1645 1650 

gtc agt cct gat cct cag aag agt tea gaa gac ate aga aca gag 
Val Ser Pro Asp Pro Gin Lys Ser Ser Glu Asp lie Arg Thr Glu 
1655 1660 1665 

get ttg gee aag gaa att gtc cac caa gac aaa tct eta gca gac 
Ala Leu Ala Lys Glu He Val His Gin Asp Lys Ser Leu Ala Asp 
1670 1675 1680 

att ttg gat cca gac tec agg ctg aag aca aca atg gac ctg atg 
He Leu Asp Pro Asp Ser Arg Leu Lys Thr Thr Met Asp Leu Met 
1685 1690 1695 

gaa ggt ttg ttt ccc cga gat gtg aac ttg ctg aag gaa aac agt 
Glu Gly Leu Phe Pro Arg Asp Val Asn Leu Leu Lys Glu Asn Ser 
1700 1705 1710 

gta aag agg aag gee ata cag aga act gtc age tct tea gga tgt 
Val Lys Arg Lys Ala He Gin Arg Thr Val Ser Ser Ser Gly Cys 
1715 1720 1725 

gaa ggc aag agg aat gaa gac aag gaa gca gtg age atg ttg gtt 
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Glu Gly Lys Arg Asn Glu Asp Lys Glu Ala Val Ser Met Leu Val 
1730 1735 1740 

aac tgc cct gcc tac tac agt gtg tct get ccc aag get gag eta 
Asn Cys Pro Ala Tyr Tyr Ser Val Ser Ala Pro Lys Ala Glu Leu 
1745 1750 1755 



6231 



ctg aac aaa ate aaa gag atg 
Leu Asn Lys lie Lys Glu Met 
1760 1765 



cca gca gaa gtg aat 
Pro Ala Glu Val Asn 
1770 



gag gaa gag 
Glu Glu Glu 



6276 



gaa cag gca gat gtc aat gaa aag aag get gag etc 
Glu Gin Ala Asp Val Asn Glu Lys Lys Ala Glu Leu 
1775 1780 1785 



att gga agt 
lie Gly Ser 



6321 



etc ace cac aag ctg gag acc 
Leu Thr His Lys Leu Glu Thr 
1790 1795 



etc cag gag gcg aag ggg age ctg 
Leu Gin Glu Ala Lys Gly Ser Leu 
1800 



6366 



etc acg gac ate aag etc aac aac gcc ctg gga gaa gag gtg gag 
Leu Thr Asp lie Lys Leu Asn Asn Ala Leu Gly Glu Glu Val Glu 
1805 1810 1815 



6411 



get ctg ate age gag etc tgc aag ccc aat gag ttt gac aag tat 
Ala Leu lie Ser Glu Leu Cys Lys Pro Ash Glu Phe Asp Lys Tyr 
1820 1825 1830 



6456 



agg atg ttc ata ggg gat ttg gac aag gtg gtc aac ctg ctg etc 6501 
Arg Met Phe He Gly Asp Leu Asp Lys Val Val Asn Leu Leu Leu 
1835 1840 1845 

tec etc teg ggg cgt eta gcc cgt gtt gag aat gtc ctt age ggc 6546 
Ser Leu Ser Gly Arg Leu Ala Arg Val Glu Asn Val Leu Ser Gly 
1850 1855 I860 

ctt ggt gaa gat gcc agt aat gaa gaa agg age tct ctt tac gag 6591 
Leu Gly Glu Asp Ala Ser Asn Glu Glu Arg Ser Ser • Leu Tyr Glu 
1865 1870 1875 



aaa agg aag ate ctg get ggt cag cat gag gat gcc egg gag ctg 
Lys Arg Lys He Leu Ala Gly Gin His Glu Asp Ala Arg Glu Leu 
1880 1885 1890 



6636 



aag gag aac ctg gat cgc agg gag cga gta gtg ctg ggc ate ttg 
Lys Glu Asn Leu Asp Arg Arg Glu Arg Val Val Leu Gly He Leu 
1895 1900 1905 



6681 



gcc aat tac ctt tea gag gag 
Ala Asn Tyr Leu Ser Glu Glu 
1910 1915 



cag etc cag gac tac cag cac ttc 
Gin Leu Gin Asp Tyr Gin His Phe 
• 1920 



6726 



gtg aaa atg aag tec acg etc etc att gag caa egg aag ctg gat 
Val Lys Met Lys Ser Thr Leu Leu He Glu Gin Arg Lys Leu Asp 
1925 1930 1935 



6771 



gac aag ate aag ctg ggc cag 
Asp Lys He Lys Leu Gly Gin 
1940 1945 



gag cag gtc aag tgt 
Glu Gin Val Lys Cys 
1950 



ctg ctg gag 
Leu Leu Glu 



6816 



age ctg ccc tea gat ttc att ccc aag. get ggg gcc 
Ser Leu Pro Ser. Asp Phe lie Pro Lys Ala Gly Ala 
' 1955 I960 1965 



ctg get ctg 
Leu Ala Leu 



6861 



ccc cca aac etc acg agt gag ' ccc att cct get ggg ggc tgt act 
Pro Pro Asn Leu Thr Ser Glu Pro lie Pro Ala Gly Gly Cys Thr 
1970 1975 1980 



6906 



ttc agt ggt att ttc cca aca tta acc tct cca ctt taacctcttc 



6952 
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Phe Ser Gly He Phe Pro Thr Leu Thr Ser Pro Leu 
1985 1990 1995 

taaaataccc aaccaaaaga tcactgtttc tctcaacact atttaatctg aaaaatgttt 7012 

cagtacaaac cactgtttga actatctggg ttattggtgt ttgttcctga tgaaaggaaa 7072 

aaaattctct ccaggaggaa gcctttttcc ttcttgccct tcctgattga tcttctgaga 7132 

gctcgaatgc tgctggacac gtaccccttt ctattattac tttgtagtag aaagaaagtt 7192 

aatgaaactg agaactgatt ggagggtgtt tgatcattta gtttttaaca ggctgaggca 7252 

acatggatca gtgtgtgtcc ccctcaggaa tgtatccaca gtggccttcc ttgctggtgg 7312 

gcagtgtatc ctgatggcag ggtacaagta ccattaatga agggtctgca acataaagcc 7372 

ttaaaaagac acacactaag aaaactgtaa aaccttgaac attgttattt atatttttta 7432 

aaatggaaaa gatcactatg tttgttgtgc taaccactta tttgattctg ttttgtggtg 7492 

gacatagatg attacgtttg agctttgtat tttgtgaaaa ccttaatgaa atgaattcca 7552 

aagat 7557 

<210> 38 

<211> 1995 

<212> PRT 

<213> NM_020859 ShrmL, Shroom-related protein 

<400> 38 

Met Arg Thr Thr Glu Asp Phe His Lys Pro Ser Ala Thr Leu Asn Ser 
1 5 10 15 

Asn Thr Ala Thr Lys Gly Arg Tyr He Tyr Leu Glu Ala Phe Leu Glu 
20 25 30 

Gly Gly Ala Pro Trp Gly Phe Thr Leu Lys Gly Gly Leu Glu His Gly 
35 40 45 

Glu Pro Leu He He Ser Lys Val Glu Glu Gly Gly Lys Ala Asp Thr 
50 55 60 

Leu Ser Ser Lys Leu Gin Ala Gly Asp Glu Val Val His He Asn Glu 
65 70 75 80 

Val Thr Leu Ser Ser Ser Arg Lys Glu Ala Val Ser Leu Val Lys Gly 
85 90 95 

Ser Tyr Lys Thr Leu Arg Leu Val Val Arg Arg Asp Val Cys Thr Asp 
100 105 110 



Pro Gly His Ala Asp Thr Gly Ala Ser Asn Phe Val Ser Pro Glu His 
115 120 125 
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Leu Thr Ser Gly Pro Gin His Arg Lys Ala Ala Trp Ser Gly Gly Val 
130 135 140 



Lys Leu Arg Leu Lys His Arg Ser Ser Glu Pro Ala Gly Arg Pro His 
145 150 155 160 



Ser Trp His Thr Thr Lys Ser Gly Glu Lys Gin Pro Asp Ala Ser Met 
165 170 175 



Met Gin lie Ser Gin Gly Met lie Gly Pro Pro Trp His Gin Ser Tyr 
180 185 190 



His Ser Ser Ser Ser Thr Ser Asp Leu Ser Asn Tyr Asp His Ala Tyr 
195 200 • . 205 



Leu Arg Arg Ser Pro Asp Gin Cys Ser Ser Gin Gly Ser Met Glu Ser 
210 215 220 



Leu Glu Pro Ser Gly Ala Tyr Pro Pro Cys His Leu Ser Pro Ala Lys 
225 230 235 240 



Ser Thr Gly Ser He Asp Gin Leu Ser His Phe His Asn Lys Arg Asp 
245 250 255 



Ser Ala Tyr Ser Ser Phe Ser Thr Ser Ser Ser He Leu Glu Tyr Pro 
260 265 270 



His Pro Gly He Ser Ala Arg Glu Arg Ser. Gly Ser Met Asp Asn Thr 
275 280 285 



Ser Ala Arg Gly Gly Leu Leu Glu Gly Met Arg Gin Ala Asp He Arg 
290 295 300 



Tyr Val Lys Thr Val Tyr Asp Thr Arg Arg Gly Val Ser Ala Glu Tyr 
305 310 315 320 



Glu Val Asn Ser Ser Ala Leu Leu Leu Gin Gly Arg Glu Ala Arg Ala 
325 330 335 



Ser Ala Asn Gly Gin Gly Tyr Asp Lys Trp Ser Asn lie Pro Arg Gly 
340 345 350 



Lys Gly Val Pro Pro Pro Ser Trp Ser Gin Gin Cys Pro Ser Ser Leu 
355 360 365 



Glu Thr Ala Thr Asp Asn Leu Pro Pro Lys Val Gly Ala Pro Leu Pro 
370 375 380 



Pro Ala Arg Ser Asp Ser Tyr Ala Ala Phe Arg His Arg Glu Arg Pro 
385 " 390 395 . 400 
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Ser Ser Trp Ser Ser Leu Asp Gin Lys Arg Leu Cys Arg Pro Gin Ala 
405 410 415 



Asn Ser Leu Gly Ser Leu Lys Ser Pro Phe lie Glu Glu Gin Leu His 
420 425 - 430 



Thr Val Leu Glu Lys Ser Pro Glu Asn Ser Pro Pro Val Lys Pro Lys 
435 440 445 



His Asn Tyr Thr Gin Lys Ala Gin Pro Gly Gin Pro Leu Leu Pro Thr 
450 " 455 4 60 



Ser lie Tyr Ala Val Pro Ser Leu Glu Pro His Phe Ala Gin Val Pro 
465 470 475 480 



Gin Pro Ser Val Ser Ser Asn Gly Met Leu Tyr Pro Ala Leu Ala Lys 
485 490 495 



Glu Ser Gly Tyr He Ala Pro Gin Gly Ala Cys Asn Lys Met Ala Thr 
500 505 510 



He Asp Glu Asn Gly Asn Gin Asn Gly Ser Gly Arg Pro Gly Phe Ala 
515 520 525 



Phe Cys Gin Pro Leu Glu His Asp Leu Leu Ser Pro Val Glu Lys Lys 
. 530 535 540 



Pro Glu Ala Thr Ala Lys Tyr Val Pro Ser Lys Val His Phe Cys Ser 
545 550 555 560 



Val Pro Glu Asn Glu Glu Asp Ala Ser Leu Lys Arg His Leu Thr Pro 
565 570 575 



Pro Gin Gly Asn Ser Pro His Ser Asn Glu Arg Lys Ser Thr His Ser 
580 585 590 



Asn Lys Pro Ser Ser His Pro His Ser Leu Lys Cys Pro Gin Ala Gin 
595 600 605 



Ala Trp Gin Ala Gly Glu Asp Lys Arg Ser Ser Arg Leu Ser Glu Pro 
610 "* 615 620 



Trp Glu Gly Asp Phe Gin Glu Asp His Asn Ala Asn Leu Trp Arg Arg 
625 630 635 640 



Leu Glu Arg Glu Gly Leu Gly Gin Ser Leu Ser Gly Asn Phe Gly Lys 
645 ~ 650 655 



Thr Lys Ser Ala Phe Ser Ser Leu Gin Asn .He Pro Glu Ser Leu Arg 
660 665 670 
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Arg His Ser Ser Leu Glu Leu Gly Arg Gly Thr Gin Glu Gly Tyr Pro 
675 680 " 685 



Gly Gly Arg Pro Thr Cys Ala Val Asn Thr Lys Ala Glu Asp Pro Gly 
690 695 " 700 



Arg Lys Ala Ala Pro Asp Leu Gly Ser His Leu Asp Arg Gin Val Ser 
705 710 715 720 



Tyr Pro Arg Pro Glu Gly Arg Thr Gly Ala Ser Ala Ser Phe Asn Ser 
725 730 735 



Thr Asp Pro Ser Pro Glu Glu Pro Pro Ala Pro Ser His Pro His Thr 
740 745 • 750 



Ser Ser Leu Gly Arg Arg Gly Pro Gly Pro Gly Ser Ala Ser Ala Leu 
755 . 760 765 



Gin Gly Phe Gin Tyr Gly Lys Pro His Cys Ser Val Leu Glu Lys Val 
770 775 780 



Ser Lys Phe Glu Gin Arg Glu Gin Gly Ser Gin Arg Pro Ser Val Gly 
785 790 795 800 



Gly Ser Gly Phe Gly His Asn Tyr Arg Pro His Arg Thr Val Ser Thr 
805 810 815 



Ser Ser Thr Ser Gly Asn Asp Phe Glu Glu Thr Lys Ala His lie Arg 
820 825 830 



Phe Ser Glu Ser Ala Glu Pro Leu Gly Asn Gly Glu Gin His Phe Lys 
835 840 845 



Asn Gly Glu Leu Lys Leu Glu Glu Ala Ser Arg Gin Pro Cys Gly Gin 
850 855 860 



Gin Leu Ser Gly Gly Ala Ser Asp Ser Gly Arg Gly Pro Gin Arg Pro 
865 870 875 880 



Asp Ala Arg Leu Leu Arg Ser Gin Ser Thr Phe Gin Leu Ser Ser Glu 
885 890 895 



Pro Glu Arg Glu Pro Glu Trp Arg Asp Arg Pro Gly Ser Pro Glu Ser 
900 905 910 



Pro Leu Leu Asp Ala Pro Phe Ser Arg Ala Tyr Arg Asn Ser He Lys 
915 920 925 



Asp Ala Gin Ser Arg Val Leu Gly Ala Thr Ser Phe Arg Arg Arg Asp 
. 930 935 940 
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Leu Glu Leu Gly Ala Pro Val Ala Ser Arg Ser Trp Arg Pro Arg Pro 
945 950 955 960 



Ser Ser Ala His Val Gly Leu Arg Ser Pro Glu Ala Ser Ala Ser Ala 
965 970 975 



Ser Pro His Thr Pro Arg Glu Arg His Ser Val Thr Pro Ala Glu Gly 
980 985 990 



Asp Leu Ala Arg Pro Val Pro Pro Ala Ala Arg Arg Gly Ala Arg Arg 
995 1000 1005 



Arg Leu Thr Pro Glu Gin Lys Lys Arg Ser Tyr Ser Glu Pro Glu 
1010 1015 1020 



Lys Met Asn Glu Val Gly He Val Glu Glu Ala Glu Pro Ala Pro 
1025 1030 1035 



Leu Gly Pro Gin Arg Asn Gly Met Arg Phe Pro Glu Ser Ser Val 
1040 1045 1050 



Ala Asp Arg Arg Arg Leu Phe Glu Arg Asp Gly Lys Ala Cys Ser 
1055 1060 1065 



Thr Leu Ser Leu Ser Gly Pro. Glu Leu Lys Gin Phe Gin Gin Ser 
1070 1075 1080 



Ala Leu Ala Asp Tyr He Gin Arg Lys Thr Gly Lys Arg Pro Thr 
1085 1090 1095 



Ser Ala Ala Gly Cys Ser Leu Gin Glu Pro Gly Pro Leu Arg Glu 
1100 1105 1110 



Arg Ala Gin Ser Ala Tyr Leu Gin Pro Gly Pro Ala Ala Leu Glu 
1115 1120 1125 



Gly Ser Gly Leu Ala Ser Ala Ser Ser Leu Ser Ser Leu Arg Glu 
1130 1135 1140 



Pro Ser Leu Gin Pro Arg Arg Glu Ala Thr Leu" Leu Pro Ala Thr 
1145 1150 1155 



Val Ala Glu Thr Gin Gin Ala Pro Arg Asp Arg Ser Ser Ser Phe 
1160 1165 1170 



Ala Gly Gly Arg Arg Leu Gly Glu Arg Arg Arg Gly Asp Leu Leu 
1175 1180 1185 



Ser Gly Ala Asn Gly Gly Thr Arg Gly Thr Gin Arg Gly Asp Glu 
1190 1195 1200 
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Thr Pro Arg Glu Pro Ser Ser Trp Gly Ala Arg Ala Gly Lys Ser 
1205 1210' 1215 



Met Ser Ala Glu Asp Leu Leu Glu Arg Ser Asp Val Leu Ala Gly 
1220 1225 " 1230 



Pro Val His Val Arg Ser Arg Ser Ser Pro Ala Thr Ala Asp Lys 
1235 1240 1245 

Arg Gin Asp Val Leu Leu Gly Gin Asp Ser Gly Phe Gly Leu Val 
1250 1255 1260 



Lys Asp Pro Cys Tyr Leu Ala Gly Pro Gly Ser Arg Ser Leu Ser 
1265 1270 1275 

Cys Ser Glu Arg Gly Gin Glu Glu Met Leu Leu Leu Phe His His 
. 1280 " 1285 1290 



Leu Thr Pro Arg Trp Gly Gly Ser Gly Cys Lys Ala lie Gly Asp 
1295 1300 . 1305 



Ser Ser Val Pro Ser Glu Cys Pro Gly Thr Leu Asp His Gin Arg 
1310 1315 1320 



Gin Ala Ser Arg Thr Pro Cys Pro Arg Pro Pro Leu Ala Gly Thr 
1325 " 1330 1335 



Gin Gly Leu Val Thr Asp Thr Arg Ala Ala Pro Leu Thr Pro He 
1340 1345 1350 



Gly Thr Pro Leu Pro Ser Ala He Pro Ser Gly Tyr Cys Ser Gin 
1355 1360 1365 



Asp Gly Gin Thr Gly Arg Gin Pro Leu Pro Pro Tyr Thr Pro Ala 
1370 1375 1380 



Met Met His Arg Ser Asn Gly His Thr Leu Thr Gin Pro Pro Gly 
1385 1390 1395 



Pro Arg Gly Cys Glu Gly Asp Gly Pro Glu His Gly Val Glu Glu 
1400 1405 1410 



Gly Thr Arg Lys Arg Val Ser Leu Pro Gin Trp Pro Pro Pro Ser 
1415 1420 1425 



Arg Ala Lys Trp Ala His Ala Ala Arg Glu Asp Ser Leu Pro Glu 
1430 " 1435 1440 



Glu Ser Ser Ala Pro Asp Phe Ala Asn Leu Lys His Tyr Gin Lys 
1445 1450 1455 
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Gin Gin Ser Leu Pro Ser Leu Cys Ser Thr Ser Asp Pro Asp Thr 
1460 1465 * 1470 



Pro Leu Gly Ala Pro Ser Thr Pro Gly Arg lie Ser Leu Arg lie 
1475 1480- * 1485 



Ser Glu Ser Val Leu Arg Asp Ser Pro Pro Pro His Glu Asp Tyr 
1490 1495 1500 



Glu Asp Glu Val Phe Val Arg Asp Pro His Pro Lys Ala Thr Ser 
1505 1510 1515 



Ser Pro Thr Phe Glu Pro Leu Pro Pro Pro Pro Pro Pro Pro Pro 
1520 1525 1530 



Ser Gin Glu Thr Pro Val Tyr Ser Met Asp Asp Phe Pro Pro Pro 
1535 1540 ■ 1545 



Pro Pro His Thr Val Cys Glu Ala Gin Leu Asp Ser Glu Asp Pro 
1550 1555 1560 



Glu Gly Pro Arg Pro Ser Phe Asn Lys Leu Ser Lys Val Thr lie 
1565 1570 1575 



Ala Arg Glu Arg His Met Pro Gly Ala Ala His Val Val Gly Ser 
1580 1585 1590 



Gin Thr Leu Ala Ser Arg Leu Gin Thr Ser lie Lys Gly Ser Glu 
1595 1600 1605 



Ala Glu Ser Thr Pro Pro Ser Phe Met Ser Val His Ala Gin Leu 
1610 1615 1620 



Ala Gly Ser Leu Gly Gly Gin Pro Ala Pro lie Gin Thr Gin Ser 
1625 1630 1635 



Leu Ser His Asp Pro Val Ser Gly Thr Gin Gly Leu Glu Lys Lys 
1640 1645 1650 



Val Ser Pro Asp Pro Gin Lys Ser Ser Glu Asp lie Arg Thr Glu 
1655 1660 1665 



Ala Leu Ala Lys Glu lie Val His Gin Asp Lys Ser Leu Ala Asp 
1670 1675 1680 



lie Leu Asp Pro Asp Ser Arg Leu Lys Thr Thr Met Asp Leu Met 
1685 1690 1695 



Glu Gly Leu Phe Pro Arg Asp Val Asn Leu Leu Lys Glu Asn Ser 
1700 1705 1710 
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Val Lys Arg Lys Ala lie Gin Arg Thr Val Ser Ser Ser Gly Cys 
1715 1720 1725 



Glu Gly Lys Arg Asn Glu Asp Lys Glu Ala Val Ser Met Leu Val 
1730 1735 1740 



Asn Cys Pro Ala Tyr Tyr Ser Val Ser Ala Pro Lys Ala Glu Leu 
1745 1750 1755 



Leu Asn Lys lie Lys Glu Met Pro Ala Glu Val Asn Glu Glu Glu 
1760 1765 1770 



Glu Gin Ala Asp Val Asn Glu Lys Lys Ala Glu Leu lie Gly Ser 
1775 1780 1785 



Leu Thr His Lys Leu Glu Thr Leu Gin Glu Ala Lys Gly Ser Leu 
1790 1795 1800 



Leu Thr Asp lie Lys Leu Asn Asn Ala Leu Gly Glu Glu Val Glu 
1805 1810 1815 



Ala Leu lie Ser Glu Leu Cys Lys Pro Asn Glu Phe Asp Lys Tyr 
1820 1825 1830 



Arg Met Phe He Gly Asp Leu Asp Lys Val Val Asn Leu Leu Leu 
1835 1840 1845 



Ser Leu Ser Gly Arg Leu Ala Arg Val Glu Asn Val Leu Ser Gly 
1850 1855 1860 



Leu Gly Glu Asp Ala Ser Asn Glu Glu Arg Ser Ser Leu Tyr Glu 
1865 1870 1875 



Lys Arg Lys He Leu Ala Gly Gin His Glu Asp Ala Arg Glu Leu 
1880 1885 1890 



Lys Glu Asn Leu Asp Arg Arg Glu Arg Val Val Leu Gly He Leu 
1895 1900 1905 



Ala Asn Tyr Leu Ser Glu Glu Gin Leu Gin Asp Tyr Gin His Phe 
1910 1915 1920 



Val Lys Met Lys Ser Thr Leu Leu He Glu Gin Arg Lys Leu Asp 
1925 1930 1935 



Asp Lys He Lys Leu Gly Gin Glu Glh Val Lys Cys Leu Leu Glu 
1940 1945 1950 



Ser Leu Pro Ser Asp Phe He Pro Lys Ala Gly Ala Leu Ala Leu 
1955 I960 1965 
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Pro Pro Asn Leu Thr Ser Glu Pro He Pro Ala Gly Gly Cys Thr 
1970 1975 1980 

Phe Ser Gly He Phe Pro Thr Leu Thr Ser Pro Leu 
1985 1990 1995 

<210> 39 

<211> 2529 

<212> DNA . . 

<213> NMJ)04433 ELF3, E74-like factor 3 



<220> 

<221> CDS 

<222> <203) . . (1315) 

<223> 



<400> 39 

gagagagaga gagagagaga actagtctcg aggtcgacgc ggccgcgaat tcgcggccgc 60 

gtcgaccgaa gacgcaggct ctatttagag ccgggtaggg gagcgcacgg ccagatacct 120 

cagcgctacc tggcggaact ggatttctct cccgcctgcc ggcctgcctg ccacagccgg 180 

actccgccac tccggtagcc tc atg get gca acc tgt gag att age aac att 232 

Met Ala Ala Thr Cys Glu He Ser Asn He 
15 10 

ttt age aac tac ttc agt gcg atg tac age teg gag gac tec acc ctg 280 
Phe Ser Asn Tyr Phe Ser Ala Met Tyr Ser Ser Glu Asp Ser Thr Leu 
15 20 25 

gee tct gtt ccc cct get gec acq ttt ggg gee gat gac ttg gta ctg 328 
Ala Ser Val Pro Pro Ala Ala Thr Phe Gly Ala Asp Asp Leu Val Leu 
30 35 40 

acc ctg age aac ccc cag atg tea ttg gag ggt aca gag aag gee age 37 6 

Thr Leu Ser Asn Pro Gin Met Ser Leu Glu Gly Thr Glu Lys Ala Ser 
45 50 55 

tgg ttg ggg gaa cag ccc cag ttc tgg teg aag acg cag gtt ctg gac 424 
Trp Leu Gly Glu Gin Pro Gin Phe Trp Ser Lys Thr Gin Val Leu Asp 
60 65 70 

tgg ate age tac caa gtg gag aag aac aag tac gac gca age gee att 472 
Trp He Ser Tyr Gin Val Glu Lys Asn Lys Tyr Asp Ala Ser Ala He 
75 80 85 90 

gac ttc tea cga tgt gac atg gat ggc gee acc etc tgc aat tgt gee 520 
Asp Phe Ser Arg Cys Asp Met Asp Gly Ala Thr Leu Cys Asn Cys Ala 
95 - 100 105 

ctt gag gag ctg cgt ctg gtc ttt ggg cct ctg. ggg gac caa etc cat 568 
Leu Glu Glu Leu Arg Leu Val Phe Gly Pro Leu Gly Asp Gin Leu His 
110 115 120 
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gcc cag ctg cga gac etc act tec age tct tct gat gag etc agt tgg 
Ala Gin Leu Arg Asp Leu Thr Ser Ser Ser Ser Asp Glu Leu Ser Trp 
125 130 135 



616 



ate att gag ctg ctg gag aag gat ggc atg gcc ttc cag gag gcc eta 664 
lie lie Glu Leu Leu Glu Lys Asp Gly Met Ala Phe Gin Glu Ala Leu 
140 145 150 

gac cca ggg ccc ttt gac cag ggc age ccc ttt gcc cag gag ctg ctg 712 
Asp Pro Gly Pro Phe Asp Gin Gly Ser Pro Phe Ala Gin Glu Leu Leu 
155 ~ 160 165 170 

gac gac ggt cag caa gcc age ccc tac cac ccc ggc age tgt ggc gca 760 
Asp Asp Gly Gin Gin Ala Ser Pro Tyr His Pro Gly Ser Cys Gly Ala 
175 180 185 

gga gcc ccc tec ccc ggc age tct gac gtc tec acc gca ggg act ggt 808 
Gly Ala Pro Ser Pro Gly Ser Ser Asp Val Ser Thr Ala Gly Thr Gly 
190 195.' 200. 

get tct egg age tec cac tec tea gac tec ggt gga agt gac gtg gac 856 
Ala Ser Arg Ser Ser His Ser Ser Asp Ser Gly Gly Ser Asp Val Asp 
205 210 215 

ctg gat ccc act gat ggc aag etc ttc ccc age gat ggt ttt cgt gac 904 
Leu Asp Pro Thr Asp Gly Lys Leu Phe Pro Ser Asp Gly Phe Arg Asp 
220 225 230 

tgc aag aag ggg gat ccc aag cac ggg aag egg aaa cga ggc egg ccc 952 
Cys Lys Lys Gly Asp Pro Lys His Gly Lys Arg Lys Arg Gly Arg Pro 
235 240 245 250 

cga aag ctg age aaa gag tac tgg gac tgt etc gag ggc aag aag age 1000 
Arg Lys Leu Ser Lys Glu Tyr Trp Asp Cys Leu Glu Gly Lys Lys Ser 
255 260 265 

aag cac gcg ccc aga ggc acc cac ctg tgg gag ttc ate egg gac ate 1048 
Lys His Ala Pro Arg Gly Thr His Leu Trp Glu Phe lie Arg Asp lie 
270 275 280 

etc ate cac ccg gag etc aac gag ggc etc atg aag tgg gag aat egg 1096 
Leu lie His Pro Glu Leu Asn Glu Gly Leu Met Lys Trp Glu Asn Arg 
285 290 295 

cat gaa ggc gtc ttc aag ttc ctg cgc tec gag get gtg gcc caa eta 1144 
His Glu Gly Val Phe Lys Phe Leu Arg Ser Glu Ala Val Ala Gin Leu 
300 305 310 

tgg ggc caa aag aaa aag aac age aac atg acc tac gag aag ctg age 1192 
Trp Gly Gin Lys Lys Lys Asn Ser Asn Met Thr Tyr Glu Lys Leu Ser 
315 320 325 330 

egg gcc atg agg tac tac tac aaa egg gag ate ctg gaa egg gtg gat 1240 
Arg Ala Met Arg Tyr Tyr Tyr Lys Arg Glu lie Leu Glu Arg Val Asp 
335 340 345 

ggc egg cga etc gtc tac aag ttt ggc aaa aac tea age ggc tgg aag 1288 
Gly Arg Arg Leu Val Tyr Lys Phe Gly Lys Asn Ser Ser Gly Trp Lys 
350 355 360 

gag gaa gag gtt etc cag agt egg aac tgagggttgg aactataccc 1335 
Glu Glu Glu Val Leu Gin Ser Arg Asn 
365 370 

gggaccaaac tcacggacca ctcgaggcct gcaaaccttc ctgggaggac aggcaggeca 1395 

gatggcccct ccactgggga atgctcccag ctgtgctgtg gagagaagct gatgttttgg 1455 
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tgtattgtca 


gccatcgtcc 


ttqqactcqq 


agactatggc ctcgcctccc 


caccctcctc 


1515 


ttggaattac 


aaqccctqqq 

3 w 3 3 3 


gtttgaagct 


gactttatag ctgcaagtgt 


atctcctttt 


1575 


atctggtgcc 


tcctcaaacc 


■cagtctcaga 


cactaaatgc agacaacacc 


ttcctcctgc 


1635 


agacacttgg 


actgagccaa 


qqaqacttaa 


cracrcrppctaq qaaacaccqt 


qatqqaqaqq 


1695 


acacraocaaa 


ggctccagca 


cttctttctg 


gactggcgtfc cacctcccfcg 


etcagtgett 


1755 


qqqctccaca 


qac aacroo t c 


agagcactcc 


rtaafrttato tactatataa 


atatgtcaga 


1815 


tgtacataga 


gatctatttt 


ttctaaaaca 


ttcrrctccc cactcctctc 


ccacagagtg ' 


1875 


ctoaactott 


ccagcjccctc 


cacrtaacrctcr 


a t or 1 1 era crap {""fttacrcfata 


gggctcccag 


1935 


"ctcctttctc 


ctgtgaatgg 


aaocaoacra c 


ctccaataaa gtgccttctg 


ggctttttct 


1995 


aacctttgtc 


ttagctacct 


atatactcraa 

v»^* wu\* wy 


atttooacct tt aerate era a 
a l» *"*y y y ** ** j y 3 


tataatcaaa 


2055 


aqqttqqaqq 

" 3 v " **3 3*"*3 3 


qqaqqaaaat 

I? J** 33 ****** *» 


gaaggtctac 


caggatgagg gtgagggcaa 


aaactqacqa 
u yy vi -y uv y u 


2115 


aqaqqqgaqt 


tacagatttc 


ctataqcaqq 


totooactta caoacacatq 


qactqqqctq 

3 t * v * fc "333 vw 3 


2175 


ggaggcgagc 


aaaggaagca 


gctgagactg 


ttggagaacg ttacaagact 


teatgeaaga 


2235 


aggacatgaa 


ctcagaacac 


tgaggtcaga 


ageatctget gtcatgacac 


cgctcgagtg 


2295 


accttgacct 


tgaccaagtc 


ttgtccttgt 


ttaggactga tttttcctat 


taggctaggg 


2355 


tttggacctg 


atgttctcaa 


gatgtctaga 


attgeatgge tggccttgtg 


gaatagatgg 


2415 


ttttgcattc 


cagccaagtg 


tgctgtaaac 


tgtatatctg taatatgaat 


cccagctttt 


2475 


gagtctgaca 


aaatcagagt 


taggatcttg 


taaaggaaaa aaaaaaaaaa 


aaaa 


2529 



<210> 40 

<211> 371 

<212> PRT 

<213> NMJJ04433 ELF3, E74-like factor 3 

<400> 40 

Met Ala Ala Thr Cys Glu lie Ser Asn lie Phe Ser Asn Tyr Phe Ser 
1 5 10' 15 



Ala Met Tyr Ser Ser Glu Asp Ser Thr Leu Ala Ser Val Pro Pro Ala 
20 25 30 



Ala Thr Phe Gly Ala Asp Asp Leu Val Leu Thr Leu Ser Asn Pro Gin 
35 40 45 



Met Ser Leu Glu Gly Thr Glu Lys Ala Ser Trp Leu Gly Glu Gin Pro 
50 55 * 60 



Gin Phe Trp Ser Lys Thr Gin Val Leu Asp Trp lie Ser Tyr Gin Val 
65 70 75 80 



WO 2004/022778 PCT/AU2003/001 166 

112 



Glu Lys Asn Lys Tyr Asp Ala Ser Ala lie Asp Phe Ser Arg Cys Asp 
85 90 95 



Met Asp Gly Ala Thr Leu Cys Asn Cys Ala Leu Glu Glu Leu Arg Leu 
100 105 110 



Val Phe Gly Pro Leu Gly Asp Gin Leu His Ala Gin Leu Arg Asp Leu 
115 120 125 



Thr Ser Ser Ser Ser Asp Glu Leu Ser Trp He He Glu Leu Leu Glu 
130 135 140 



Lys Asp Gly Met Ala Phe Gin Glu Ala Leu Asp Pro Gly Pro Phe Asp 
145 150 155 160 



Gin Gly Ser Pro Phe Ala Gin Glu Leu Leu Asp Asp Gly Gin Gin Ala 
165 170 175 



Ser Pro Tyr His Pro Gly Ser Cys Gly Ala Gly Ala Pro Ser Pro Gly 
180 185 190 



Ser Ser Asp Val Ser Thr Ala Gly Thr Gly Ala Ser Arg Ser Ser His 
195 200 205 



Ser Ser Asp Ser Gly Gly Ser Asp Val Asp Leu Asp Pro. Thr Asp Gly 
210 215 220 



Lys Leu Phe Pro Ser Asp Gly Phe Arg Asp Cys Lys Lys Gly Asp Pro 
225 230 " 235 240 



Lys His Gly Lys Arg Lys Arg Gly Arg Pro Arg Lys Leu Ser Lys Glu 
245 250 . 255 



Tyr Trp Asp Cys Leu Glu Gly Lys Lys Ser Lys His Ala Pro Arg Gly 
260 265 270 



Thr His Leu Trp Glu Phe He Arg Asp .He Leu He His Pro Glu Leu 
275 280 285 



Asn Glu Gly Leu Met Lys Trp Glu Asn Arg His Glu Gly Val Phe Lys 
290 295 300 



Phe Leu Arg Ser Glu Ala Val Ala Gin Leu Trp Gly Gin Lys Lys Lys 
305 310 315 320 



Asn Ser Asn Met Thr Tyr Glu Lys Leu Ser Arg Ala Met Arg Tyr Tyr 
325 330 335 



Tyr Lys Arg Glu He Leu Glu Arg Val Asp Gly Arg Arg Leu Val Tyr 
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340 ' 345 350 



Lys Phe Gly Lys Asn Ser Ser Gly Trp Lys Glu Glu Glu Val Leu Gin 
355 . 360 365 



Ser Arg Asn 
370 



<210> 41 

<211> 4020 

<212> DNA 

<213> NM_019027 FLJ20273 
<220> 

<221> CDS 

<222> (240) . . (1811) 

<223> 



<40O> 41 

gtcaattgcc ctttagtccc aggactaacc ggaagcttct gcaacaggag gacattgaaa 60 

ataagatgga acccatccac ataaggattt gcctcaaagg gcactgcaaa aattgaacag 120 

aggaatccca aggaagctgc ctgaatttgc ctgtatactc tcgttctgcg acttataaag 180 

gaccagacaa atcaaattag tggttttggt ttccgccagc tgtggatgcc tttgacatt 239 

atg acc gca gag gat tec acc gca gec atg age agt gac teg gec gec 287 
Met Thr Ala Glu Asp Ser Thr Ala Ala Met Ser Ser Asp Ser Ala Ala 
15 10 15 

ggg tec teg gec aag gtg ccc gag ggc gtg gcg ggc gcg ccc aac gag 335 
.Gly Ser Ser Ala Lys Val Pro Glu Gly Val Ala Gly Ala Pro Asn Glu 
20 25 30 

gca gca ctg ctg gcg ctg atg gag cgc acg ggc tac age atg gtg caa 383 
Ala Ala Leu Leu Ala Leu Met Glu Arg Thr Gly Tyr Ser Met Val Gin 
35 40 45 

gag aac ggg cag cgc aag tac ggc ggc cca ccg ccc ggc tgg gag ggc 431 
Glu Asn Gly Gin Arg Lys Tyr Gly Gly Pro Pro Pro Gly Trp Glu Gly 
50 55 60 

ccg cac ccg cag cgt ggc tgc gag gtc ttc gtg ggc aag ate ccg cgc 479 
Pro His Pro Gin Arg Gly Cys Glu Val Phe Val Gly Lys lie Pro Arg 
65 70 75 80 

gac gtg tac gag gac gag ctg gtg ccc gtg ttc gag gee gtg ggc cgc 527 
Asp Val Tyr Glu Asp Glu Leu Val Pro Val Phe Glu Ala Val Gly Arg 
85 90 95 

acc tac gag ctg cgc etc atg atg gac ttt gac ggc aag aac cgc ggc 575 
Thr Tyr Glu Leu Arg Leu Met Met Asp Phe Asp Gly Lys Asn Arg Gly 
100 105 110 
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tac gcc ttc gtc atg tac tgc cac aag cac gag gcc aag cgc gca gtg 623 
Tyr Ala Phe Val Met Tyr Cys His Lys His' Glu Ala Lys Arg Ala Val 
115 120 125 

cgt gag etc aac aac tac gag ate cgc ccg ggc cgc ctg etc ggc gtg 671 
Arg Glu Leu Asn Asn Tyr Glu lie Arg Pro Gly Arg Leu Leu Gly Val 
130 135 140 

tgc tgc age gtg gac aac tgc cgc etc ttc ate ggc ggg ate ccc aag 719 
Cys Cys Ser Val Asp Asn Cys Arg Leu Phe He Gly Gly He Pro Lys 
145 150 155 160 

atg aag aag cgc gag gaa ate ctg gag gag att gcc aag gtc acc gag 767 
Met Lys Lys Arg Glu Glu He Leu Glu Glu He Ala Lys Val Thr Glu 
165 170 175 

ggc gtg ctg gac gtg ate gtc tac gcc age gcg gcc gac aag atg aag 815 
Gly Val Leu Asp Val He Val Tyr Ala Ser Ala Ala Asp Lys Met Lys 
180 185 190 

aac cgc ggc ttc gcc ttc gtg gag tac gag age cac cgc gcg get gcc 863 
Asn Arg Gly Phe Ala Phe Val Glu Tyr Glu Ser His Arg Ala Ala Ala 
195 200 205 

atg get cgc cgc aag etc atg cct ggc cgc ate cag ctg tgg ggc cac 911 
Met Ala Arg Arg Lys Leu Met Pro Gly Arg He Glh Leu Trp Gly His 
210 215 220 

cag ate gcc gtg gac tgg gcc gag cct gag ate gac gtg gac gag gac 959 
Gin He Ala Val Asp Trp Ala Glu Pro Glu He Asp Val Asp Glu Asp 
225 230 235 240 

gtg atg gag acc gtg aag ate etc tac .gtg cgc aac etc atg ate gag 1007 
Val Met Glu Thr Val Lys He Leu Tyr Val Arg Asn Leu Met He Glu 
245 250 " 255 

acc acc gag gac acc ate aag aag age ttc ggc cag ttc aac ccc ggc 1055 
Thr Thr Glu Asp Thr He Lys Lys Ser Phe Gly Gin Phe Asn Pro Gly 
260 265 270 

tgc gtg gag cgc gtc aag aag ate cgc gac tac gcc ttc gtg cac ttc 1103 
Cys Val Glu Arg Val Lys Lys He Arg Asp Tyr Ala Phe Val His Phe 
275 280 285 

acc age cgc gag gat gcc gtg cat gcc atg aac aac etc aac ggc act 1151 
Thr Ser Arg Glu Asp Ala Val His Ala Met Asn Asn Leu Asn Gly Thr 
• 290 295 300 

gag ctg gag ggc teg tgc ctg gag gtc acg ctg gcc aag ccc gtg gac 1199 
Glu Leu Glu Gly Ser Cys Leu Glu Val Thr Leu Ala Lys Pro Val Asp 
305 310 315 320 

aag gag cag tac teg cgc tac cag aag gca gcc agg ggc ggc ggc gcg 1247 
Lys Glu Gin Tyr Ser Arg Tyr Gin Lys Ala Ala Arg Gly Gly Gly Ala 
325 330 335 

get gag gca gcg cag cag ccc age tac gtg tac tec tgc gac ccc tac 1295 
Ala Glu Ala Ala Gin Gin Pro Ser Tyr Val Tyr Ser Cys Asp Pro Tyr 
340 345 " 350 

aca ctg gcc tac tac ggc tac ccc tac aac gcg etc att ggg ccc aac 1343 
Thr Leu Ala Tyr Tyr Gly Tyr Pro Tyr Asn Ala Leu He Gly Pro Asn 
355 360 365 

agg gac tac ttt gtg aaa gta gcc ate cct gcc att ggg get cag tat 1391 
Arg Asp Tyr Phe Val Lys Val Ala He Pro Ala He Gly Ala Gin Tyr 
370 375 380 
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tec atg ttt cca .gca get cca gee cct aaa atg att gaagat ggc aaa 1439 
Ser Met Phe Pro Ala Ala Pro Ala Pro Lys Met lie Glu Asp Gly Lys 
385 390 395 * 400 

ate cac aca gtg gag cac atg ate age ccc att get gtg cag cca gac 1487 
He His Thr Val Glu His Met He Ser Pro He Ala Val Gin Pro Asp 
405 410 415 

cca gec agt get get gee gee gca gee gcg gee gca gee gee gca gee 1535 
Pro Ala Ser Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
420 425 430 

get gtc att ccc act gtg teg acg cca cca cct ttc xag ggc cgc cca 1583 
Ala Val He Pro Thr Val Ser Thr Pro Pro Pro Phe Gin Gly Arg Pro 
435 440 445 

ata act cca gta tac acg gtg get cca aac gtt cag aga att cct act 1631 
He Thr Pro Val Tyr Thr Val Ala Pro Asn Val Gin Arg lie Pro Thr 
450 455 460 

gee ggg ate tac ggg gee agt tac gtg cca ttt get get cca get aca 1679 
Ala Gly He Tyr Gly Ala Ser Tyr Val Pro Phe Ala Ala Pro Ala Thr 
465 470 475 480 

gee acg ate gee aca eta cag aag aac gcg gca gee gcg gee gee gtg 1727 
Ala Thr He Ala Thr Leu Gin Lys Asn Ala Ala Ala Ala Ala Ala Val- 
485 490 495 

tat gga gga tac gca ggc tac ata cct cag gee ttc cct get get gee 1775 
Tyr Gly Gly Tyr Ala Gly Tyr He Pro .Gin Ala Phe Pro Ala Ala Ala 
500 505 ■ 510 

att cag gtc ccc ate ccc gac gtc tac cag aca tac tgaggctggt 1821 
lie Gin Val Pro lie Pro Asp Val Tyr Gin Thr Tyr 
515 520 

gaccagcacg aagacagacc acacaaacac cactgaagga aegcttgact atttatgaag 1881 

aaggaacatg ttggattcac acatgcaacc tgaaagtgaa gaatgttagc agatttattt 1941 

ctgaattatt ttatatacat gaagttttca ctagtttttt aagactattt tcaacttagc 2001 

atgcctacgt tcatacattt ccaaaagact tgcaatggtt cgtgccttca ttccatcttt 2061 

taaaaatttg tatgetgtae tacatttgta tagaggtttt tgttgttgtt tttttaagga 2121 

tatattttca gtatgaaggt tattttctta acttctgcac tccagagatt tctattttgt 2181 

agtaccttca ataatatatc aactatatat taaaaaagca cacttgagga gctagggaac 2241 

tattttgaaa aatatataca atatttaaag atacaaacag tagtgcttaa aaatactaca 2301 

taaagcatta ttttaaaggt tatactggaa agtgcaattt taaaatgagt aaaacctctg 2361 

tatttctget ggcattaagg gttgatggtg ttaccatgta tcatcatggc ggtactattt 2421 

tttaaaagaa attaaacact ggatctctcc ttaagecaac attgaaaaga cttgccgcac 2481 

ttctgagtcc aaacactgga aagctctcct tgccaccgtt ageegggget cattctccat 2541 

gtgecttage cttaaacatg cccccactcc cacatctctc accctgtccc ctcctcccca 2601 

gattcccaat cccaccgcaa tgtttggcaa gectaggact gataagtagc tctgatagag 2661 

gagctggtgg cttttatact tcttcctggg tttttgttgg ggtttgttgt ttcgttgttt 2721 

tttgtttttt ttttgtttgg ttggggaagt attgtcttct acgtgtgcta ttttcagtag 2781 
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cagagtaagc acaaggtttt aatcgagttg cataagacac ctttgcatag ctatttaatt 2841 

gcccaatgta aaactttaat gccatttcta atgcttttat tcatttttga agtatgagtt 2901 

tgtagggaca aagaatgtat gttatcgtag acaagacccc cagagactct tttcagcaga 2961 

aagttatgct tctagttgcc ttaccatgtt tcttgcaaaa ctgtccatgg tcctcaaggg 3021 

tgttggaaac attatgttta ttaaatgggc ctctcttcct ttgctgtgca cttgatgggt 3081 

gaactggatt ggggtgtgca catccaggag gaggaggaga gacctgtaga agtttaaaga 3141 

tagtttgtaa atatcttcta atgcttgttt ttagtccttt tatgttggag aagttcatgg 3201 

tatgtagttt aatgcaaaat gaaaccattt tatttcaatg ttattaaaaa ggtttgtttt 3261 

attaggaagt taatgtattg ttgcagtgtt ttgtgcctgt ttaaaggctt ttgtttagca 3321 

gagtgaatgt aaaatacagt aaaatgttaa gattgtcatc tactttttaa aaaaaaatat 3381 

caacttggaa ttgtttttta aaggctcaat caaggaagtg aggtgtgcaa taaggtagca 3441 

agtaaaacgc agttgcgttt ttatgtcatg ttagagatcc atacaatttt ccactcacgg 3501 

gatttttgtt gatggctgaa ttcttgtgga ttcataagag gatcatgccc ttagcaagta 3561 

cttttgtttt gttttaaatt aagagattcc caaatgcctt tttccccctc atcttgaaat 3621 

gagatgagtt tttatgtgta agcaatattt atttaactat tctataaaat tattgagtgc 3681 

ctactgaggc ctttaagcac cgctaacatt cctttccatc attcttttga atgacataaa 3741 

ataattgtgc aatgttcctg atgatgtacc ccacagctgc attcaaactc aaatctgtgg 3801 

gaatgagtga ctcgaccaaa tgtaattcgg atcagatcct catcccctga ctgtgtgaaa 3861 

aaagtactct ccttctagtg aaggattgtc acagagtttc actggatgaa actatgaccc 3921 

agtattctta ctgtatttta catgtgcctg taaattattt tgccgaaata agaagaagaa 3981 

gaggaagaaa gaacagtaga aaaaaaaaat aaaaaaaaa 4020 

<210> 42 
<211> 524 
<212> PRT 

<213>- NM_019027 FLJ20273 
<400> 42 

Met Thr Ala Glu Asp Ser Thr Ala Ala Met Ser Ser Asp Ser Ala Ala 
15 10 15 

Gly Ser Ser Ala Lys Val Pro Glu Gly Val Ala Gly. Ala Pro Asn Glu 
20 25 30 

Ala Ala Leu Leu Ala Leu Met Glu Arg Thr Gly Tyr Ser Met Val Gin 
35 40 45 

Glu Asn Gly Gin Arg Lys Tyr Gly Gly Pro Pro Pro Gly Trp Glu Gly 
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50 55 60 

Pro His Pro Gin Arg Gly .Cys Glu Val Phe Val Gly Lys lie Pro Arg 
65 70 75' 80 

Asp Val Tyr Glu Asp Glu Leu Val Pro Val Phe Glu Ala Val Gly Arg 
85 90 95 

Thr Tyr Glu Leu Arg Leu Met Met Asp Phe Asp Gly Lys Asn Arg Gly 
100 105 ~ 110 

Tyr Ala Phe Val Met Tyr Cys His Lys His Glu Ala Lys Arg Ala Val 
115 120 125 

Arg Glu Leu Asn Asn Tyr Glu lie Arg pro Gly- Arg Leu Leu Gly Val 
130 135 140 

Cys Cys Ser Val Asp Asn Cys Arg Leu Phe lie Gly Gly lie Pro Lys 
145 150 155 " 160 

Met Lys Lys Arg Glu Glu lie Leu Glu Glu He Ala Lys Val Thr Glu 
165 - 170 175 

Gly Val Leu Asp Val lie Val Tyr Ala Ser Ala Ala Asp Lys Met Lys 
180 185 . 190 

Asn Arg Gly Phe Ala Phe Val Glu Tyr Glu Ser His Arg Ala Ala Ala 
195 200 205 

Met Ala Arg Arg Lys Leu Met Pro Gly Arg He Gin Leu Trp Gly His 
210 215 220 

Gin He Ala Val Asp Trp Ala Glu Pro Glu He Asp Val Asp Glu Asp 
225 230 235 240 

Val Met Glu Thr Val Lys He Leu Tyr Val Arg Asn Leu Met He Glu 
245 250 255 

Thr Thr Glu Asp Thr He Lys Lys Ser Phe Gly Gin Phe Asn Pro Gly 
260 265 270 

Cys Val Glu Arg Val Lys Lys lie Arg Asp Tyr Ala Phe Val His Phe 
275 280 285 

Thr Ser Arg Glu Asp Ala Val His Ala Met Asn Asn Leu Asn Gly Thr 
290 295 -300 

Glu Leu Glu Gly Ser Cys Leu Glu Val Thr Leu Ala Lys Pro Val Asp 
305 310 315 320 



Lys Glu Gin Tyr Ser Arg Tyr Gin Lys Ala Ala Arg Gly Gly Gly Ala 
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325 330 335 



Ala Glu Ala Ala Gin Gin Pro Ser Tyr Val Tyr Ser Cys Asp Pro Tyr 
340 345 " 350 



Thr Leu Ala Tyr Tyr Gly Tyr Pro Tyr Asn Ala Leu He Gly Pro Asn 
355 360 365 



Arg Asp Tyr Phe Val Lys Val Ala lie Pro Ala He Gly Ala Gin Tyr 
370 375 380 



Ser Met Phe Pro Ala Ala Pro Ala Pro Lys Met He Glu Asp Gly Lys 
385 390 395 400 



He His Thr Val Glu His Met lie Ser Pro He Ala Val Gin Pro Asp 
405 410 415 



Pro Ala Ser Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
420 425 430 



Ala Val He Pro Thr Val Ser Thr Pro Pro Pro Phe Gin. Gly Arg Pro 
435 440 445 



He Thr Pro Val Tyr Thr Val Ala Pro Asn Val Gin Arg He Pro Thr 
450 455 460 



Ala Gly He Tyr Gly Ala Ser Tyr Val Pro Phe Ala Ala Pro Ala Thr 
465 470 475 480 



Ala Thr He Ala Thr Leu Gin Lys Asn Ala Ala Ala Ala Ala Ala Val 
485 '490 495 



Tyr Gly Gly Tyr Ala Gly Tyr He Pro Gin Ala Phe Pro Ala Ala Ala 
500 505 510 



He Gin Val Pro He Pro Asp Val Tyr Gin Thr Tyr 
515 520 



<210> 43 

<211> 1451 

<212> DNA 

<213> NMJJ13952 PAX 8 
<220> 

<221> CDS 

<222> (161) . . (1354) 

<223> 
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<400> 43 

ttcagaagga ggagagacac cgggcccagg gcaccctcgc gggcgggcgg acccaagcag 60 

tgagggcctg cagccggccg gccagggcag cggcaggcgc ggcccggacc tacgggagga 120 

agccccgagc cctcggcggg ctgcgagcga ctccccggcg atg cct cac aac tec 175 

Met Pro His Asn Ser 
1 5 

ate aga tct ggc cat gga ggg ctg aac cag ctg gga ggg gec ttt gtg 223 
He Arg Ser Gly His Gly Gly Leu Asn Gin Leu Gly Gly Ala Phe Val 
10 15 20 

aat ggc aga cct ctg ccg gaa gtg gtc cgc cag cgc ate gta gac ctg 271 
Asn Gly Arg Pro Leu Pro Glu Val Val Arg Gin Arg He Val Asp Leu 
25 30 35 

gec cac cag ggt gta agg ccc tgc gac ate tct cgc cag etc cgc gtc 319 
Ala His Gin Gly Val Arg Pro Cys Asp He Ser Arg Gin Leu Arg Val 
40 45 50 

age cat ggc tgc gtc age aag ate ctt ggc agg tac tac gag act ggc 367 
Ser His Gly Cys Val Ser Lys He Leu Gly Arg Tyr Tyr Glu Thr Gly 
55 60 65 

age ate egg cct gga gtg ata ggg ggc tec aag ccc aag gtg gee ace 415 
Ser He Arg Pro Gly Val He Gly Gly Ser Lys Pro Lys Val Ala Thr 
70 75 80 85 

ccc aag gtg gtg gag aag att ggg gac tac aaa cgc cag aac cct ace 463 
Pro Lys Val Val Glu Lys He Gly Asp Tyr Lys Arg Gin Asn Pro Thr 
90 95 100 

atg ttt gec tgg gag ate cga gac egg etc ctg get gag ggc gtc tgt 511 
Met Phe Ala Trp Glu He Arg Asp Arg Leu Leu Ala Glu Gly Val Cys 
105 110 115 

gac aat gac act gtg ccc agt gtc age tec att aat aga ate ate egg 559 
Asp Asn Asp Thr Val Pro Ser Val Ser Ser He Asn Arg He lie Arg 
120 125 130 

ace aaa gtg cag caa cca ttc aac etc cct atg gac age tgc gtg gee 607 
Thr Lys Val Gin Gin Pro Phe Asn Leu Pro Met Asp Ser Cys Val Ala 
135 140 145 

ace aag tec ctg agt ccc gga cac acg ctg ate ccc age tea get gta 655 
Thr Lys Ser Leu Ser Pro Gly His Thr Leu He Pro Ser Ser Ala Val 
150 155 160 165 

act ccc ccg gag tea ccc cag teg gat tec ctg ggc tec acc tac tec 703 
Thr Pro Pro Glu Ser Pro Gin Ser Asp Ser Leu .Gly Ser Thr Tyr Ser 
170 175 180 

ate aat ggg etc ctg ggc ate get cag cct ggc age gac aag agg aaa 751 
He Asn Gly Leu Leu Gly He Ala Gin Pro Gly Ser Asp Lys Arg Lys 
185 190 195 

atg gat gac agt gat cag gat age tgc cga eta age att gac tea cag 799 
Met Asp Asp Ser Asp Gin Asp Ser Cys Arg Leu Ser He Asp Ser Gin 
200 205 210 

age age age age gga ccc cga aag cac ctt cgc acg gat gec ttc age 847 
Ser Ser Ser Ser Gly Pro Arg Lys His Leu Arg Thr Asp Ala Phe Ser 
215 220 225 
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cag cac cac etc gag ccg etc gag tgc cca ttt gag egg cag cac tac 895 
Gin His His Leu Glu Pro Leu Glu Cys Pro Phe Glu Arg Gin His Tyr 
230 235 240 " 245 

cca gag gee tat gec tec ccc age cac acc aaa ggc gag cag ggc etc 943 
Pro Glu Ala Tyr Ala Ser Pro Ser His Thr Lys Gly Glu Gin Gly Leu 
250 255 ~ 260 

tac ccg ctg ccc ttg etc aac age acc ctg gac gac ggg aag gec acc 991 
Tyr Pro Leu Pro Leu Leu Asn Ser Thr Leu Asp Asp Gly Lys Ala Thr 
265 270 275 

ctg acc cct tec aac acg cca ctg ggg cgc aac etc teg act cac cag 1039 
Leu Thr Pro Ser Asn Thr Pro Leu Gly Arg Asn Leu Ser Thr His Gin 
280 285 290 

acc tac ccc gtg gtg gca get ccg ccc ttt tgg ate tgc age aag teg 1087 
Thr Tyr Pro Val Val Ala Ala Pro Pro Phe Trp He Cys Ser Lys Ser 
295 300 305 

get ccg ggg tec cgc cct tea atg cct ttc ccc atg ctg cct ccg tgt 1135 
Ala Pro Gly Ser Arg Pro Ser Met Pro Phe Pro Met Leu Pro Pro Cys 
310 315 320 325 

acg ggc agt tea egg gee agg ccc tec. tct cag ggc gag aga tgg tgg 1183 
Thr Gly Ser Ser Arg Ala Arg Pro Ser Ser Gin Gly Glu Arg Trp Trp 
330 335 340 

ggc cca cgc tgc ccg gat acc cac ccc aca tec cca cca gcg gac agg 1231 
Gly Pro Arg Cys Pro Asp Thr His Pro Thr Ser Pro Pro Ala Asp Arg 
345 350 355 

gca get atg cct cct ctg cca teg cag gca tgg tgg cag gaa gtg aat 1279 
Ala Ala Met Pro Pro Leu Pro Ser Gin Ala Trp Trp Gin Glu Val Asn 
360 365 370 

act ctg gca atg cct atg gee aca ccc cct act cct cct aca gcg agg 1327 
Thr Leu Ala Met Pro Met Ala Thr Pro Pro Thr Pro Pro Thr Ala Arg 
375 380 385 

cct ggg get tec cca act cca get tgc tgagttcccc atattattac 1374 
Pro Gly Ala Ser Pro Thr Pro Ala Cys • - 
390 395 

agttccacat caaggecgag tgcaccgccc accactgcca eggectttga ccatctgtag 1434 
ttgccatggg gacagtg 1451 

<210> 44 

<211> 398 

<212> PRT 

<213> NM 013952 PAX 8 



<400> 44 

Met Pro His Asn Ser He Arg Ser Gly His Gly Gly Leu Asn Gin Leu 
1 5 10 15 



Gly Gly Ala Phe Val Asn Gly Arg Pro Leu Pro Glu Val Val Arg Gin 
20 25 30 
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Arg lie Val Asp Leu Ala His Gin Gly Val Arg Pro Cys Asp He Ser 
35 40 45 

Arg Gin Leu Arg Val Ser His Gly Cys Val Ser Lys He Leu Gly Arg 
50 ** 55 60 

Tyr Tyr Glu Thr Gly Ser He Arg Pro Gly Val He Gly Gly Ser Lys 
65 70 .75 80 

Pro Lys Val Ala Thr Pro Lys Val Val Glu Lys. He Gly Asp Tyr Lys 
85 ' 90 95 

Arg Gin Asn Pro Thr Met Phe Ala Trp Glu He Arg Asp Arg Leu Leu 
100 105 110 

Ala Glu Gly Val Cys Asp Asn Asp Thr Val Pro Ser Val Ser Ser He 
115 120 125 

Asn Arg He He Arg Thr Lys Val Gin Gin Pro Phe Asn Leu Pro Met 
130 135 140 

Asp Ser Cys Val Ala Thr Lys Ser Leu Ser Pro Gly His Thr Leu He 
145 150 155 160 

Pro Ser Ser Ala Val Thr Pro Pro Glu Ser Pro Gin Ser Asp Ser Leu 
165 170 175 

Gly Ser Thr Tyr Ser He Asn Gly Leu Leu Gly He Ala Gin Pro Gly 
180 185 190 

Ser Asp Lys Arg Lys Met Asp Asp Ser Asp Gin Asp Ser Cys Arg Leu 
195 200 205 

Ser He Asp Ser Gin Ser Ser Ser Ser Gly Pro Arg Lys His Leu Arg 
210 215 220 

Thr Asp Ala Phe Ser Gin His His Leu Glu Pro Leu Glu Cys Pro Phe 
225 230 235 240 

Glu Arg Gin His Tyr Pro Glu Ala Tyr Ala Ser Pro Ser His Thr Lys 
245 250 255 

Gly Glu Gin Gly Leu Tyr Pro Leu Pro Leu Leu Asn Ser Thr Leu Asp 
260 265 270 

Asp Gly Lys Ala Thr Leu Thr Pro Ser Asn Thr Pro Leu Gly Arg Asn 
275 280 285 



Leu Ser Thr His Gin Thr Tyr Pro Val Val Ala Ala Pro Pro Phe Trp 
290 295 300 
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He Cys Ser Lys Ser Ala Pro Gly Ser Arg Pro Ser Met Pro Phe Pro 
305 310 315 320 

Met Leu Pro Pro Cys Thr Gly -Ser Ser Arg Ala Arg Pro Ser Ser Gin 
325 ' 330 " 335 

Gly Glu Arg Trp Trp Gly Pro Arg Cys Pro Asp Thr His Pro Thr Ser 
340 345 350 

Pro Pro Ala Asp Arg Ala Ala Met Pro Pro Leu Pro Ser Gin Ala Trp 
355 360 365 

Trp Gin Glu Val Asn Thr Leu Ala Met Pro Met Ala Thr Pro Pro Thr 
370 375 380 

Pro Pro Thr Ala Arg Pro Gly Ala Ser Pro Thr Pro Ala Cys 
385 390 395 

<210> 45 

. <211> 326 

<212> DNA 

<213> AI301558 EST 

<400> 45 

• tttctttttt gttacattca tttgattcag tcccttataa accccacacc tcataaacaa 60 

gagattagaa actaaacaaa aaggggggcg gggaaggaaa ttctagagtc gttctggttt 120 

gcaggtggtt gcggtcacaa agagaaatca tcaagaatgt tcacttggca tgtgtgaaag 180 

attcaggggg tctgcagctg tttagtgttg atgcagttgg gtcaaaagag tatcatgtta 240 

gtcttctgtg ggttttaggg agggattatg gagcctccct cccaccccac tggctttctt 300 

gtgtcacagc ctttatttct actccg 326 

<210> 46 

<211> 1534 

<212> DNA 

<213> NMJU8000 FLJ10116 
<220> 

<221> CDS 

<222> (267) .. (938) 

<223> 
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<400> 46 

tttgcgctcg gggaattaaa agaggggaaa aaaagcccga agaaaactca cgccccaaac 60 

aaaacgcaag gagaggaggg cgcgcggcct gcagccctcg cccgcgtccc cggccgcggc 120 

gtgatgcgcg cggaccagcc cgcgacgccc gggctgccgc tgtccccgca cctggacgct 180 

ggcgcggtgg ccgcgcccca gcctcgatcg ctcgccgcgg cgactcggcc ccaggcttcc 240 

ggcgccggtg ggggccctcg ctctcc atg ggg ctg agg gac tgg ctg aga acc 293 

Met Gly Leu Arg Asp Trp Leu Arg Thr 
1 5 

gtg tgc tgc tgc tgc egg tgc gag tgc ttg gag gag cgc gec ctg cct 341 
Val Cys Cys Cys Cys Arg Cys Glu Cys Leu Glu Glu Arg Ala Leu Pro 
10 ~ 15 20 25 

gag aag gag ccc etc gtc agt gat aac aat cca tat tec tea ttt gga 389 
Glu Lys Glu Pro Leu Val Ser Asp Asn Asn Pro Tyr Ser Ser Phe Gly 
30 35 40 

gca act ctg gtg agg gat gat gag aag aat tta tgg agt atg ccc cat 437 
Ala Thr Leu Val Arg Asp Asp Glu Lys Asn Leu Trp Ser Met Pro His 
45 50 55 

gat gtg tec cac aca gag gca gac gac gac aga acc ctg tac aat ttg 485 
Asp Val Ser His Thr Glu Ala Asp Asp Asp Arg Thr Leu Tyr Asn Leu 
60 65 70" 

ata gtc att cgt aat cag cag gec aaa gac tea gag gag tgg cag aag 533 
lie Val lie Arg Asn Gin Gin Ala Lys Asp Ser Glu .Glu Trp Gin Lys 
75 80 85 

etc aac tat gat ate cat acc ctg egg cag gtt cga agg gaa gta aga 581 
Leu Asn Tyr Asp He His Thr Leu Arg Gin Val Arg Arg Glu Val Arg 
90 95 100 105 

aac aga tgg aag tgc ate tta gaa gat tta ggt ttt caa aag gaa get 629 
Asn Arg Trp Lys Cys He Leu Giu Asp Leu Gly Phe Gin Lys Glu Ala 
110 115 120 

gac tct ttg ttg tea gtg act aaa etc age acc ate agt gat tct aaa 677 
Asp Ser Leu Leu Ser Val Thr Lys Leu Ser Thr He Ser Asp Ser Lys 
125 130 135 

aac aca agg aaa get cga gag atg ttg tta aaa ctg get gaa gaa acc 725 
Asn Thr Arg Lys Ala Arg Glu Met Leu Leu Lys Leu Ala Glu Glu Thr 
140 145 150 

agt att ttc cca aca agt tgg gag etc tea gag aga tat etc ttt gtt 773 
Ser He Phe Pro Thr Ser Trp Glu Leu Ser Glu Arg Tyr Leu Phe Val 
155 160 165 

gtg gac cgt etc att gca ctt gat get gca gaa gag ttc ttt aag ctt 821 
Val Asp Arg Leu He Ala Leu Asp Ala Ala Glu Glu Phe Phe Lys Leu 
170 175 180 185 

get cgt cga act tac ccc aag aag cct ggg gtt cca tgc ctg gca gat 869 
Ala Arg Arg Thr Tyr Pro Lys Lys Pro Gly Val Pro Cys Leu Ala Asp 
190 195 200 

ggc cag aaa gaa ctg cac ctg tgg ggg gac etc tea tgc aga ctt gca 917 
Gly Gin Lys Glu Leu His Leu Trp Gly Asp Leu Ser Cys Arg Leu Ala 
205 210 215 

cat atg cag gga gta ttg cac tgaagatctt tgctggacct tcttctcttc 968 
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His Met Gin Gly Val Leu His 
220 

agaagataat tttcaaaagg gagcaatgct gtgaatgcag cttgcttctc tctacagatt 1028 

gagaagtcca gcttcaaaag ttacttgcca cttaagcaag gaacttgtca agagatcatg 1088 

gttcatgtta ctgaaaagac tttaaggatt .tgtaaggtta atccatagat tgctgagaac 1148 

aatggaaata tttttatttt tacagatttt gcacttctga attcaggtta aaaactaact 1208 

tgtatttagt ctgcttagag gactgtgact tgaaaatttt tatataccaa tgagcttttt 1268 

ggtagcgtcc acaatgttta aaatatttca taggcgagat ccgtgttctc catttattaa 1328 

tgcattgtag accaatttaa ctgctgtgtt tcaggaaaat tcttcctagt ttaataagca 1388 

agctaaaagt tttatttttt atatttagtg cttaatcttt gcctcatgtt atgtaaaatt 1448 

agcctgcaga tattttctct caattctgta gactctcgca agataaacat tcaaacagtg 1508 

aaacaaacaa taaaataaat aaacct 1534 

<210> 47 
<211> 224 
<212> PRT 

<213> NM_018000 FLJ10116 
<400> 47 

Met Gly Leu Arg Asp Trp Leu Arg Thr Val Cys Cys Cys Cys Arg Cys 
15 10 15 

Glu Cys Leu Glu Glu Arg Ala Leu Pro Glu Lys Glu Pro Leu Val Ser 
20 25 30 

Asp Asn Asn Pro Tyr Ser Ser Phe Gly Ala Thr Leu Val Arg Asp Asp 
35 40 45 

Glu Lys Asn Leu Trp Ser Met Pro His Asp Val Ser His Thr Glu Ala 
50 55 60 

Asp Asp Asp Arg Thr Leu Tyr Asn Leu He Val He Arg Asn Gin Gin 
65 70 75 80 

Ala Lys Asp Ser Glu Glu Trp Gin Lys Leu Asn Tyr Asp He His Thr 
85 90 95 

Leu Arg Gin Val Arg Arg Glu Val Arg Asn Arg Trp Lys Cys He Leu 
100 105 • 110 

Glu Asp Leu Gly Phe Gin Lys Glu Ala Asp Ser Leu Leu Ser Val Thr 
115 120 125 
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Lys Leu Ser Thr He Ser Asp Ser Lys Asn Thr Arg Lys Ala Arg Glu 
130 135 140 



Met Leu Leu Lys Leu Ala Glu Glu Thr Ser lie Phe Pro Thr Ser Trp 
145 150 155 160 



Glu Leu Ser Glu Arg Tyr Leu Phe Val Val Asp Arg Leu He Ala Leu 
165 170 175 



Asp Ala Ala Glu Glu Phe Phe Lys Leu Ala Arg Arg Thr Tyr Pro Lys 
180 185 190 



Lys Pro Gly Val Pro Cys Leu Ala Asp Gly Gin Lys Glu Leu His Leu 
195 200 205 



Trp Gly Asp Leu Ser Cys Arg Leu Ala His Met Gin Gly Val Leu His 
210 215 220 



<210> 48 

<211> 2385 

<212> DNA 

<213> NM 144724 FLJ30532 



<220> 

<221> CDS 

<222> (71) . . (1441) 

<223> 



<400> 48 

gataccggtt tcagagtcct gggcagcgtg cgcgctcttc ctggcggctg cgcaggtgtg 60 

aaaatcacaa atg tea aat gat gga aga tec agg aat egg gac agg cgc 109 
Met Ser Asn Asp Gly Arg Ser Arg Asn Arg Asp Arg Arg 
1 5 10 

tac gat gag gte'eca age gac ctg ccc tat caa gat acc acc ata aga 157 
Tyr Asp Glu Val Pro Ser Asp Leu Pro Tyr Gin Asp Thr Thr He Arg 
15 20 25 

acc cac cca att ctt cat gac agt gag egg gca gtg age get gat ccc 205 
Thr His Pro He Leu His Asp Ser Glu Arg Ala Val Ser Ala Asp Pro 
30 35 40 45 

ttg cca cca ccc cct etc cca tta cag cca cca ttc ggc cca gac ttc 253 
Leu Pro Pro Pro Pro Leu Pro Leu Gin Pro Pro Phe Gly Pro Asp Phe 
50 55 * 60 

tac tea agt gac aca gaa gaa cca get ata gcg cca gat etc aaa cca 301 
Tyr Ser Ser Asp Thr Glu Glu Pro Ala He Ala Pro Asp Leu Lys Pro 
65 70 75 

gta agg cgc ttt gtc cct gac tec tgg aag aac ttt ttc aga ggg aag 349 
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Val Arg Arg Phe Val Pro Asp Ser Trp Lys Asn Phe Phe Arg Gly Lys 
80 85 90 

aaa aag gac ccc gaa tgg gat aag ccg gtg tct gat ate agg tac ate 397 
Lys Lys Asp Pro Glu Trp Asp Lys Pro Val Ser Asp lie Arg Tyr He 
95 100 105 

tec gat gga gtg gag tgt tea cca cca gee tct cca gca aga cca aac 445 
Ser Asp Gly Val Glu Cys Ser Pro Pro Ala Ser Pro Ala Arg Pro Asn 
110 115 120 125 

cac cgt teg ccc etc aac tec tgc aaa gat ccc tac gga ggg tea gaa 493 
His Arg Ser Pro Leu Asn Ser Cys Lys Asp Pro Tyr Gly Gly Ser Glu 
130 135 140 

gga acc ttt agt tec egg aaa gag get gac gca gtg ttt ccc egg gat 541 
Gly Thr Phe Ser Ser Arg Lys Glu Ala Asp Ala Val Phe Pro Arg Asp 
145 150 155 

ccc tat gga tct eta gac cga cac aca caa aca gtt cga aca tac agt 589 
Pro Tyr Gly Ser Leu Asp Arg His Thr Gin Thr Val Arg Thr Tyr Ser 
160 165 170 

gag aag gtg gag gag tat aac ctg aga tac tec tac atg aag teg tgg 637 
Glu Lys Val Glu Glu Tyr Asn Leu Arg Tyr Ser Tyr Met Lys Ser Trp 
175 180 185 

gca ggc ctg ctg aga ata ctg ggt gtg gtg gag ctg ctt ttg ggg gec 685 
Ala Gly Leu Leu Arg He Leu Gly Val Val Glu Leu Leu Leu Gly Ala 
190 195 200 205 

ggt gtc ttt get tgt gtc aca get tac att cac aag gac agt gag tgg 733 
Gly Val Phe Ala Cys Val Thr Ala Tyr He His Lys Asp Ser Glu Trp 
210 215 220 

tac aac ttg ttt gga tat tea caa ccg tat ggc atg gga ggc gtt ggt 781 
Tyr Asn Leu Phe Gly Tyr Ser Gin Pro Tyr Gly Met Gly Gly Val Gly 
225 230 235 

gga ttg ggc agt atg tat ggg ggc tat tac tac act ggc cct aag acc 829 
Gly Leu Gly Ser Met Tyr Gly Gly Tyr Tyr Tyr Thr Gly Pro Lys Thr 
240 . 245 250 - 

cct ttt gta etc gtg gtt get gga tta get tgg ate acc acc att att 877 
Pro Phe Val Leu Val Val Ala Gly Leu Ala Trp He Thr Thr He lie 
255 260 265 

att ctg gtt ctt ggc atg tec atg tat tac egg acc att ctt ctg gac 925 
He Leu Val Leu Gly Met Ser Met Tyr Tyr Arg Thr He Leu Leu Asp 
270 275 280 . 285 

tct aat tgg tgg ccc eta act gaa ttt gga att aac gtt gec ttg ttt 973 
Ser Asn Trp Trp Pro Leu Thr Glu Phe Gly He Asn Val Ala Leu Phe 
290 295 300 

att ttg tat atg gec gca gec ata gtc tat gtg aat gat acc aac cga 1021 
He Leu Tyr Met Ala Ala Ala He Val Tyr Val Asn Asp Thr Asn Arg 
305 310 315 

ggt ggc etc tgc tac tat ccg tta ttt aat aca cca gtg aat gca gtg 1069 
Gly Gly Leu Cys Tyr Tyr Pro Leu Phe Asn Thr Pro Val Asn Ala Val 
320 325 330 

ttc tgc egg gta gaa gga gga cag ata get gca atg ate ttc ctg ttt 1117 
Phe Cys Arg Val Glu Gly Gly Gin He Ala Ala Met He Phe Leu Phe 
335 340 345 

gtc acc atg ata gtt tat etc att agt get ttg gtt tgc eta aag tta 1165 
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Val Thr Met lie Val Tyr Leu lie Ser Ala Leu Val Cys Leu Lys Leu 
350 355 360 365 

tgg agg cat gag gca get egg aga cat aga gaa tat atg gaa caa cag 1213 
Trp Arg His Glu Ala Ala Arg Arg His Arg Glu Tyr Met Glu Gin Gin 
370 375 380 

gag ata aat gag cca tea ttg tea teg aaa agg aaa atg tgt gaa atg 1261 
Glu He Asn Glu Pro Ser Leu Ser Ser Lys Arg Lys Met Cys Glu Met 
385 390 395 

gec ace agt ggt gac aga caa aga gac tea gaa gtt aat ttc aag gaa 1309 
Ala Thr Ser Gly Asp Arg Gin Arg Asp Ser Glu Val Asn Phe Lys Glu 
400 405 410 

ctg aga aca gca aaa atg aaa cct gaa eta ctg agt gga cac ate ccc 1357 
Leu Arg Thr Ala Lys Met Lys Pro Glu Leu Leu Ser Gly His He Pro 
. 415 420 " 425 

cca cgc cca get aat ttt ttt gta ttt tta gta gag atg ggg ttt cac 1405 
Pro Arg Pro* Ala Asn Phe Phe Val Phe Leu Val Glu Met Gly Phe His 
430 435 440 445 

cgt gtt age cag gat gat etc gat etc ctg ace tea tgatccaccc 1451 
Arg Val Ser Gin Asp Asp Leu Asp Leu Leu Thr Ser 
450 455 

gcctcagcct cccaaagtgt tgggattaca ggcgtgagtc accgcgccca gctggtattg 1511 

cttttctatt ccctttggac atacatgeta cagtcccaca atgtagcatt tccttggaaa 1571 

ctcccttttt tttttttttt gagatggagt ttcgctcttg ttgcccaggc tggagtacag 1631 

tggtatgatc ttggctcact gcagcctctg cctcctgggt teaagegatt ctcctgcctc 1691 

tgcctcccaa gtagctggga ttacaggcac ccaccaccat gcccagctaa ttttttgtat 1751 

ttttagtaga gacaggattt cactatgttg gccaggttgg tctcaagctc ctgacctcag 1811 

atgatctacc agcctcggcc ttctgaagtg ctgggattca ggtgtgagcc actgtgccca 1871 

geagggatge ttcatctttc taagaattat cttggctttg gactttattc ataaatgttt 1931 

tatttctgtt agtatgaaca atagactgee ttaacaaagt ttttttttaa acaaaatcgt 1991 

tcttgttgga ttttattcag cagcatctat catgtagata aattcccagg tgtagcatta 2051 

cagcttctga ctaatatagc tgccattcag acaattaatg ttcaaagagt tttctaaagt 2111 

gataaaacca aagaaaagca tgtggaaaag cagaagctta gaaagttgtg gtcactgaat 2171 

gcactccctg gtttttattt gtcagtgaaa tetttatgea ttcattgtta atattttaat 2231 

tccatggctt tgtaggctgt gctgtgtctg aaggggtaac acctagggaa acatgaggee 2291 

ccttatggga ccccccaaat ggaacaactt cactttctct tttatgtatt gagccctgtg 2351 

ttaacatttc acttaagaag agcaccagtg cttt 2385 

<210> 49 

<211> 457 

<212> PRT 

<213> NM 144724 FLJ30532 
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<400> 49 

Met Ser Asn Asp Gly Arg Ser Arg Asn Arg Asp Arg Arg Tyr Asp Glu 
1 5 10 15 



Val Pro Ser Asp Leu Pro Tyr Gin Asp Thr Thr He Arg Thr His Pro 
20 25 30 



He Leu His Asp Ser Glu Arg Ala Val Ser Ala Asp Pro Leu Pro Pro 
35 40 45 



Pro Pro Leu Pro Leu Gin Pro Pro Phe Gly Pro Asp Phe Tyr Ser Ser 
50 55 60 



Asp Thr Glu Glu Pro Ala He Ala Pro Asp Leu Lys Pro Val Arg Arg 
65 70 • 75 80 



Phe Val Pro Asp Ser Trp Lys Asn Phe Phe Arg Gly Lys Lys Lys Asp 
85 90 ' 95 



Pro Glu Trp Asp Lys Pro Val Ser Asp He Arg Tyr He Ser Asp Gly 
100 105 110 



Val Glu Cys Ser Pro Pro Ala Ser Pro Ala Arg Pro Asn His Arg Ser 
115 120 125 



Pro Leu Asn Ser Cys Lys Asp Pro Tyr Gly Gly Ser Glu Gly Thr Phe 
130 135 140 



Ser Ser Arg Lys Glu Ala Asp Ala Val : Phe Pro Arg Asp Pro Tyr Gly 
145 150 155 ~ • 160 



Ser Leu Asp Arg His Thr Gin Thr Val Arg Thr Tyr Ser Glu Lys Val 
165 170 175 



Glu Glu Tyr Asn Leu Arg Tyr Ser Tyr Met Lys Ser Trp Ala Gly Leu 
180 185 190 



Leu Arg He Leu Gly Val Val Glu Leu Leu Leu Gly Ala Gly Val Phe 
195 200 205 



Ala Cys Val Thr Ala Tyr He His Lys Asp Ser Glu Trp Tyr Asn Leu 
210 215 220 



Phe Gly Tyr Ser Gin Pro Tyr Gly Met Gly Gly Val Gly Gly Leu Gly 
225 230 235 240 



Ser Met Tyr Gly Gly Tyr Tyr Tyr Thr Gly Pro Lys Thr Pro Phe Val 
245 250 255 
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Leu Val Val Ala Gly Leu Ala Trp lie Thr Thr lie lie lie Leu Val 
260 265 270 



Leu Gly Met Ser. Met Tyr' Tyr Arg Thr lie Leu Leu Asp Ser Asn Trp 
275 280 285 



Trp Pro Leu Thr Glu Phe Gly He Asn Val Ala Leu Phe He Leu Tyr 
290 295 300 



Met Ala Ala Ala He Val Tyr Val Asn Asp Thr Asn Arg Gly Gly Leu 
305 310 315 320 



Cys Tyr Tyr Pro Leu Phe Asn Thr Pro Val Asn Ala Val Phe Cys Arg 
325 330 335 



Val Glu Gly Gly Gin He Ala Ala Met He Phe Leu Phe Val Thr Met 
340 345 350 



He Val Tyr Leu He Ser Ala Leu Val Cys Leu Lys Leu Trp Arg His 
355 360 365 



Glu Ala Ala Arg Arg His Arg Glu Tyr Met Glu Gin Gin Glu He Asn 
370 375 380 



Glu Pro Ser Leu Ser Ser Lys Arg Lys Met Cys Glu Met Ala Thr Ser 
385 390 " 395 400 



Gly Asp Arg Gin Arg Asp Ser Glu Val Asn Phe Lys Glu Leu Arg Thr 
405 410 415 



Ala Lys Met Lys Pro Glu Leu Leu Ser Gly His He Pro Pro Arg Pro 
420 425 430 



Ala Asn Phe Phe Val Phe Leu Val Glu Met Gly Phe His Arg Val Ser 
435 440 445 



Gin Asp Asp Leu Asp Leu Leu Thr Ser 
450 455 



<210> 50 

<211> 2280 

<212> DNA 

<213> NM 006424 SLC34A2 



<220> 

<221> CDS 
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<222> <64) . . (2133) 
<223> 



<400> 50 

cgggccaggt ttccaggctc ggccgccgcc tccatcccag cacctgcgga gggagcgctg 60 

acc atg get ccc tgg cct gaa ttg gga gat gec cag ccc aac ccc gat 108 
Met Ala Pro Trp Pro Glu Leu Gly Asp Ala Gin Pro Asn Pro Asp 
15 io 15 

aag tac etc gaa ggg gec gca ggt cag cag ccc act gec cct gat aaa 156 
Lys Tyr Leu Glu Gly Ala Ala Gly Gin Gin Pro Thr Ala Pro Asp Lys 
20 25 30 

age aaa gag acc aac aaa aca gat aac act gag gca cct gta acc aag 204 
Ser Lys Glu Thr Asn Lys Thr Asp Asn Thr Glu Ala Pro Val Thr Lys 
35 40 45 

att gaa ctt ctg ccg tec tac tec acg get aca ctg ata gat gag ccc 252 
lie Glu Leu Leu Pro Ser Tyr Ser Thr Ala Thr Leu lie Asp Glu Pro 
50 55 60 

act gag gtg gat gac ccc tgg aac eta ccc act ctt cag gac teg ggg 300 
Thr Glu Val Asp Asp Pro Trp Asn Leu Pro Thr Leu Gin Asp Ser Gly 
65 70 75 

ate aag tgg tea gag aga gac acc aaa ggg aag att etc tgt ttc ttc 348 
He Lys Trp Ser Glu Arg Asp Thr Lys Gly Lys He Leu Cys Phe Phe 
80 85 -.90 95 

caa ggg att ggg aga ttg att tta ctt etc gga ttt etc tac ttt ttc 396 
Gin Gly He Gly Arg Leu He Leu Leu Leu Gly Phe Leu Tyr Phe Phe 
100 105 ~ 110 

gtg tgc tec ctg gat att ctt agt age gec ttc cag ctg gtt gga gga 444 
Val Cys Ser Leu Asp He Leu Ser Ser Ala Phe Gin Leu Val Gly Gly 
115 120 125 

aaa atg gca gga cag ttc ttc age aac age tct att atg tec aac cct 492 
Lys Met Ala Gly Gin Phe Phe Ser Asn Ser Ser He Met Ser Asn Pro 
130 135 140 

ttg ttg ggg ctg gtg ate ggg gtg ctg gtg acc gtc ttg gtg cag age 540 
Leu Leu Gly Leu Val He Gly Val Leu Val Thr Val Leu Val Gin Ser 
145 150 155 

tec age acc tea acg tec ate gtt gtc age atg gtg tec tct tea ttg 588 
Ser Ser Thr Ser Thr Ser He Val Val Ser Met Val Ser Ser Ser Leu 
160 165 170 175 

etc act gtt egg get gee ate ccc att ate atg ggg gec aac att gga 636 
Leu Thr Val Arg Ala Ala He Pro He He Met Gly Ala Asn He Gly 
180 185 190 

acg tea ate acc aac act att gtt gcg etc atg cag gtg gga gat egg 684 
Thr Ser He Thr Asn Thr He Val Ala Leu Met Gin Val Gly Asp Arg 
195 200 205 

agt gag ttc aga aga get ttt gca gga gec act gtc cat gac ttc ttc 732 
Ser Glu Phe Arg Arg Ala Phe Ala Gly Ala Thr Val His Asp Phe Phe 
210 215 220 

aac tgg ctg tec gtg ttg gtg etc ttg ccc gtg gag gtg gec acc cat 780 
Asn Trp Leu Ser Val Leu Val Leu Leu Pro Val Glu Val Ala Thr His 
225 230 235 
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tac etc gag ate ata acc cag ctt ata gtg gag age ttc cac ttc aag 
Tyr Leu Glu lie lie Thr Gin Leu He Val Glu Ser Phe His Phe Lys 
240 245 250 . . 255 

aat gga gaa gat gee cca gat ctt etg aaa gtc ate act aag ccc ttc 
Asn Gly Glu Asp Ala Pro Asp Leu Leu Lys Val He Thr Lys Pro Phe 
260 265 270 

aca aag etc att gtc cag ctg gat aaa aaa gtt ate age caa att gca 
Thr Lys Leu He Val Gin Leu Asp Lys Lys Val He Ser Gin He Ala 
275 280 285 

atg aac gat gaa aaa gcg aaa aac aag agt ctt gtc aag att tgg tgc 
Met Asn Asp Glu Lys Ala Lys Asn Lys Ser Leu Val Lys He Trp Cys 
290 295 300 



get aac tgc acc tec cct tec etc tgt tgg acg gat ggc. ate caa aac 
Ala Asn Cys Thr Ser Pro Ser Leu Cys Trp Thr Asp Gly He Gin Asn 
320 " 325 330 335 



828 



876 



924 



972 



aaa act ttt acc aac aag acc cag att aac gtc act gtt ccc teg act 1020 
Lys Thr Phe Thr Asn Lys Thr Gin He Asn Val Thr Val Pro Ser Thr 
305 310 315 



1068 



tgg acc atg aag aat gtg acc tac aag gag aac ate gee aaa tgc cag 1116 
Trp Thr Met Lys Asn Val Thr Tyr Lys Glu Asn He Ala Lys Cys Gin 
340 345 350 

cat ate ttt gtg aat ttc cac etc ccg gat ctt get gtg ggc acc ate 1164 
His He Phe Val Asn Phe His Leu Pro Asp Leu Ala Val Gly Thr He 
355 360 365 

ttg etc ata etc tec ctg ctg gtc etc tgt ggt tgc ctg ate atg att 1212 
Leu Leu He Leu Ser Leu Leu Val Leu Cys Gly Cys Leu He Met He 
370 375 380 

gtc aag ate ctg ggc tct gtg etc aag ggg cag gtc gee act gtc ate 1260 
Val Lys He Leu Gly Ser Val Leu Lys Gly Gin Val Ala Thr Val He 
385 390 395 

aag aag acc ate aac act gat ttc ccc ttt ccc ttt gca tgg ttg act 1308 
Lys Lys Thr He Asn Thr Asp Phe Pro Phe Pro Phe Ala Trp Leu Thr 
400 405 410 415 

ggc tac ctg gec ate etc gtc ggg gca ggc atg acc ttc ate gta cag 1356 
Gly Tyr Leu Ala He Leu Val Gly Ala Gly Met Thr Phe He Val Gin 
420 425 430 

age age tct gtg ttc acg teg gee ttg acc ccc ctg att gga ate ggc 1404 
Ser Ser Ser Val Phe Thr Ser Ala Leu Thr Pro Leu He Gly He Gly 
435 440 445 

gtg ata acc att gag agg get tat cca etc acg ctg ggc tec aac ate 
Val lie Thr lie Glu Arg Ala Tyr Pro Leu Thr Leu Gly Ser Asn lie 
450 455 460 

ggc acc acc acc acc gee ate ctg gee gee tta gee age cct ggc aat 
Gly Thr Thr Thr Thr Ala lie Leu Ala Ala Leu Ala Ser Pro Gly Asn 
465 470 475 

gca ttg agg agt tea etc cag ate gee ctg tgc cac ttt ttc ttc aac 
Ala Leu Arg Ser Ser Leu Gin He Ala Leu Cys His Phe Phe Phe Asn 
480 485 490 495 

ate tec ggc ate ttg ctg tgg tac ccg ate ccg ttc act cgc ctg ccc 1596 
He Ser Gly lie Leu Leu Trp Tyr Pro He Pro Phe Thr Arg Leu Pro 
500 505 510 



1452 



1500 



1548 
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ate cgc atg gee aag ggg ctg ggc aac ate tct gec aag tat cgc tgg 1644 
lie Arg Met Ala Lys Gly Leu Gly Asn lie Ser Ala Lys Tyr Arg Trp 
515 520 . 525 

ttc gec gtc ttc tac ctg ate ate ttc ttc ttc ctg ate ccg ctg acg 1692 
Phe' Ala Val Phe Tyr Leu lie lie Phe Phe Phe Leu lie Pro Leu Thr 
530 535 540 

gtg ttt ggc etc teg ctg gee ggc tgg egg gtg ctg gtt ggt gtc ggg 1740 
Val Phe Gly Leu Ser Leu Ala Gly Trp Arg. Val Leu Val Gly Val Gly 
545 550 555 

gtt ccc gtc gtc ttc ate ate ate ctg gta ctg tgc etc cga etc ctg 1788 
Val Pro Val Val Phe lie lie lie Leu Val Leu Cys Leu Arg Leu Leu 
560 565 570 . 575 

cag tct cgc tgc cca cgc gtc ctg ccg aag aaa etc cag aac tgg aac 1836 
Gin Ser Arg Cys Pro Arg Val Leu Pro Lys Lys Leu Gin Asn Trp Asn 
580 585 590 

ttc ctg ccg ctg tgg atg cgc teg ctg aag ccc tgg gat gee gtc gtc 1884 
Phe Leu Pro Leu Trp Met Arg Ser Leu Lys Pro Trp Asp Ala Val Val 
595 600 605 

tec aag ttc ace ggc tgc ttc cag atg cgc tgc tgc tac tgc tgc cgc 1932 
Ser Lys Phe Thr Gly Cys Phe Gin Met Arg Cys Cys Tyr Cys Cys Arg 
610 615 620 

gtg tgc tgc cgc gcg tgc tgc ttg ctg tgt ggc tgc ccc aag tgc. tgc 1980 
Val Cys Cys Arg Ala Cys Cys Leu Leu Cys Gly Cys Pro Lys Cys Cys 
625 630 635 

cgc tgc age aag tgc tgc gag gac ttg gag gag gcg cag gag ggg cag 2028 
Arg Cys Ser Lys Cys Cys Glu Asp Leu Glu Glu Ala Gin Glu Gly Gin 
640 645 650 655 

gat gtc cct gtc aag get cct gag acc ttt gat aac ata acc att age 2076 
Asp Val Pro Val Lys Ala Pro Glu Thr Phe Asp Asn He Thr He Ser 
660 665 670 

aga gag get cag ggt gag gtc cct gee teg gac tea aag acc gaa tgc 2124 
Arg Glu Ala Gin Gly Glu Val Pro Ala Ser Asp Ser Lys Thr Glu Cys 
675 680 685 

acg gec ttg taggggaege cccagattgt cagggatggg gggatggtcc 2173 
Thr Ala Leu 
690 

ttgagttttg catgctctcc tccctcccac ttctgcaccc tttcaccacc tcgaggagat 2233 
ttgctcccca ttagcgaatg aaattgatgc agtcctaaaa aaaaaaa 2280 



<210> 51 

<211> 690 

<212> PRT 

<213> NM 006424 SLC34A2 



<400> 51 

Met Ala Pro Trp Pro Glu Leu Gly Asp Ala Gin Pro Asn Pro Asp Lys 
15 10 15 
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Tyr Leu Glu Gly Ala Ala Gly Gin Gin Pro Thr Ala Pro Asp Lys Ser 
20 25 30 



Lys Glu Thr Asn Lys Thr Asp Asn Thr Glu Ala Pro Val Thr Lys lie 
35 40 45 



Glu Leu Leu Pro Ser Tyr Ser Thr Ala Thr Leu lie Asp Glu Pro Thr 

50 55 . 60 



Glu Val Asp Asp Pro Trp Asn Leu Pro Thr Leu Gin Asp Ser Gly lie 
65 " 70 75 80 



Lys Trp Ser Glu Arg Asp Thr Lys Gly Lys He Leu Cys Phe Phe Gin 
85 90 95 



Gly He Gly Arg Leu He Leu Leu Leu Gly Phe Leu Tyr Phe Phe Val 
100 105 110 



Cys Ser Leu Asp He Leu Ser Ser Ala Phe Gin Leu Val Gly Gly Lys 
115 120 125 



Met Ala Gly Gin Phe Phe Ser Asn Ser Ser He Met Ser Asn Pro Leu 
130 135 140 



Leu Gly Leu Val He Gly Val Leu Val Thr Val Leu Val Gin Ser Ser 
145 150 " 155 160 



Ser Thr Ser Thr Ser He Val Val Ser Met Val Ser Ser Ser Leu Leu 
. 165 170 175 



Thr Val Arg Ala Ala He Pro He He Met Gly Ala Asn He Gly Thr 
180 185 190 



Ser He Thr Asn Thr He Val Ala Leu Met Gin Val Gly Asp Arg Ser 
195 200 205 



Glu Phe Arg Arg Ala Phe Ala Gly Ala Thr Val His Asp Phe Phe Asn 
210 215 220 



Trp Leu Ser Val Leu Val Leu Leu Pro Val Glu Vai Ala Thr His Tyr 
225 230 235 240 



Leu Glu He He Thr Gin Leu He Val Glu Ser Phe His Phe Lys Asn 
245 250 255 



Gly Glu Asp Ala Pro Asp Leu Leu Lys Val He Thr Lys Pro Phe Thr 
260 265 270 



Lys Leu He Val Gin Leu Asp Lys Lys Val He Ser Gin He Ala Met 
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275 280 285 

Asn Asp Glu Lys Ala Lys Asn Lys Ser Leu Val Lys lie Trp Cys Lys 
290 295 300 

Thr Phe Thr Asn Lys Thr Gin lie Asn Val Thr Val Pro Ser Thr Ala 
305 310 315 320 

Asn Cys Thr Ser Pro Ser Leu Cys Trp Thr Asp Gly He Gin Asn Trp 
325 330 335 

Thr Met Lys Asn Val Thr Tyr Lys Glu Asn He Ala Lys Cys Gin His 
340 345 350 

He Phe Val Asn Phe His Leu Pro Asp Leu Ala Val Gly Thr He Leu 
355 360 365 

Leu He Leu Ser Leu Leu Val Leu Cys Gly Cys Leu He MetT He Val 
370 375 380 

Lys He Leu Gly Ser Val Leu Lys Gly Gin Val Ala Thr Val He Lys 
385 390 395 400 

Lys Thr He Asn Thr Asp Phe Pro Phe . Pro Phe Ala Trp Leu Thr Gly 
405 410 415 

Tyr Leu Ala He Leu Val Gly Ala Gly Met Thr Phe He Val Gin Ser 
420 425 430 

Ser Ser Val Phe Thr Ser Ala Leu Thr Pro Leu He Gly He Gly Val 
435 440 • 445 

He Thr He Glu Arg Ala Tyr Pro Leu Thr Leu Gly Ser Asn He Gly 
450 455 460 

Thr Thr Thr Thr Ala He Leu Ala Ala Leu Ala Ser Pro Gly Asn Ala 
465 470 475 480 

Leu Arg Ser Ser Leu Gin He Ala Leu Cys His Phe Phe Phe Asn He 
485 490 495 

Ser Gly He Leu Leu Trp Tyr Pro He Pro Phe Thr Arg Leu Pro He 
500 505 510 

Arg Met Ala Lys Gly Leu Gly Asn He Ser Ala Lys Tyr Arg Trp Phe 
515 520 525 

Ala Val Phe Tyr Leu He He Phe Phe Phe Leu He Pro Leu Thr Val 
530 535 540 



Phe Gly Leu Ser Leu Ala Gly Trp Arg Val Leu Val Gly Val Gly Val 
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545 550 555 560 



Pro Val Val Phe He He He Leu Val Leu Cys Leu Arg Leu Leu Gin 
565 570 575 



Ser Arg Cys Pro Arg Val Leu Pro Lys Lys Leu Gin Asn Trp Asn Phe 
580 585 ■ 590 



Leu Pro Leu Trp Met Arg Ser Leu Lys Pro Trp Asp Ala Val Val Ser 
595 600 605 



. Lys Phe Thr Gly Cys Phe Gin Met Arg Cys Cys Tyr Cys Cys Arg Val 
610 615 * 620 



Cys Cys Arg Ala Cys Cys Leu Leu Cys Gly Cys Pro Lys Cys Cys Arg 
625 630 ~ 635 640 



Cys Ser Lys Cys Cys Glu Asp Leu Glu Glu Ala- Gin Glu Gly Gin Asp 
645 650 655 



Val Pro Val Lys Ala Pro Glu Thr Phe Asp Asn He Thr He Ser Arg 
660 665 670 



Glu Ala Gin Gly Glu Val Pro Ala Ser Asp Ser Lys Thr Glu Cys Thr 
675 680 685 



Ala Leu 
690 



<210> 52 

<211> 529 

<212> DNA 

<213> AW959311, DKFZp434J037, EST 
<220> 

<221> misc_feature 

<222> (393).. (393) 

<223> the residue at position 393 is A, T C or G 



<400> 52 






ctgtttcttc 


aatggttctc ttcccttttc catcctccaa acctggcctg agcctcctga 


60 


agttgctgct 


gtgaatctga aagacttgaa aagcctccac ctgctgtgtg gacttcatct 


120 


caaggggccc 


agcctcctct ggactccacc ttggacctca gtgactcaga acttctgcct 


180 


ctaagctgct 


ctaaagtcca gactatggat gtgttctcta ggccttcagg actctagaat 


240 
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gtccatattt atttttatgt tcttggcttt gtgttttagg aaaagtgaat cttgctgttt 300 

tcaataatgt gaatgctatg ttctgggaaa atccactatg acatctaagt tttgtgtaca 360 

gagagatatt tttgcaacta tttccacctt ctnccacaac cccccacact ccactccaca 420 

ctcttgagtc tctttaccta atggtctcta cctaatggac cctcgtggcc aaaaagtcca 480 

ttaaaccaga aaggtgattg gaaaaaaaaa aaaaaaaact cgagggggg 529 

<210> 53 

<211> 2100 

<212> DNA 

<213> AF111713 JAM1, junctional adhesion molecule 1 
<220> 

<221>. CDS 

<222> <287) . . (1183) 

<223> 



<400> 53 

cgaggccgag agggcgggcc agggccgcac tccggagact cgcggttgct acgcgcacca 60 

tggctggaga cggagtttcg ctcttgatgc ccagcaggct ggagtgcgat ggcgcgattt 120 

cggctcactg caacctcctc ctcccagagg tacttctcag ccctctagct ccaactgaga 180 

acccagccag tcaggaagtc gctacttcgg gaacaccaac caatcagggg gccgtcacct 240 

gctgaaggag tccttcggcg gctgttgtgt cgggagcctg atcgcg atg ggg aca 295 

Met Gly Thr 
1 

aag gcg caa gtc gag agg aaa ctg ttg tgc etc ttc ata ttg gcg ate 343 
Lys Ala Gin Val Glu Arg Lys Leu Leu Cys Leu Phe He Leu Ala He 
5 10 15 

ctg ttg tgc tec ctg gca ttg ggc agt gtt aca gtg cac tct tct gaa 391 
Leu Leu Cys Ser Leu Ala Leu Gly Ser Val Thr Val His Ser Ser Glu 
20 25 30 35 

cct gaa gtc aga att cct gag aat aat cct gtg aag ttg tec tgt gee 439 
Pro Glu Val Arg He Pro Glu Asn Asn Pro Val Lys Leu Ser Cys Ala 
40 45 " 50 

tac teg ggc ttt tct tct ccc cgt gtg gag tgg aag ttt gac caa gga 487 
Tyr Ser Gly Phe Ser Ser Pro Arg Val Glu Trp Lys Phe Asp Gin Gly 
55 60 ** 65 

gac acc ace aga etc gtt tgc tat aat aac aag ate aca get tec tat 535 
Asp Thr Thr Arg Leu Val Cys Tyr Asn Asn Lys He Thr Ala Ser Tyr 
70 75 80 

gag gac egg gtg acc ttc ttg cca act ggt ate acc ttc aag tec gtg 583 
Glu Asp Arg Val Thr Phe Leu Pro Thr Gly He Thr Phe Lys Ser Val 
85 90 95 



i 
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aca egg gaa gac act ggg aca tac act tgt atg gtc tct gag gaa ggc 631 
Thr Arg Glu Asp Thr Gly Thr Tyr Thr Cys Met Val Ser Glu Glu Gly 
100 105 110 115 

ggc aac age tat ggg gag gtc aag gtc aag etc ate gtg ctt gtg cct 679 
Gly Asn Ser Tyr Gly Glu Val Lys Val Lys Leu He Val Leu Val Pro 
120 125 130 

cca tec aag cct aca gtt aac ate ccc tec tct gee acc att ggg aac 727 
Pro Ser Lys Pro Thr Val Asn He Pro Ser Ser Ala Thr He Gly Asn 
135 140 145 

egg gca gtg ctg aca tgc tea gaa caa gat ggt tec cca cct tct gaa 775 
Arg Ala Val Leu Thr Cys Ser Glu Gin Asp Gly Ser Pro Pro Ser Glu 
150 155 160 

tac acc tgg ttc aaa gat ggg ata gtg atg cct acg aat ccc aaa age 823 
Tyr Thr Trp Phe Lys Asp Gly He Val Met Pro Thr Asn Pro Lys Ser 
165 170 175 

acc cgt gee ttc age aac tct tec tat gtc ctg aat ccc aca aca gga 871 
Thr Arg Ala Phe Ser Asn Ser Ser Tyr Val Leu Asn Pro Thr Thr Gly 
180 185 190 195 

gag ctg gtc ttt gat ccc ctg tea gec tct gat act gga gaa tac age 919 
Glu Leu Val Phe Asp Pro Leu Ser Ala Ser Asp Thr Gly Glu Tyr Ser 
200 205 210 

tgt gag gca egg aat ggg tat ggg aca .ccc atg act tea aat get gtg 967 
Cys Glu Ala Arg Asn Gly. Tyr Gly Thr Pro Met Thr Ser Asn Ala Val 
215 220 225 

cgc atg gaa get gtg gag egg aat gtg ggg gtc ate gtg gca gee gtc 1015 
Arg Met Glu Ala Val Glu Arg Asn Val Gly Val He Val Ala Ala Val 
230 235 240 

ctt gta acc ctg att etc ctg gga ate ttg gtt ttt ggc ate tgg ttt 1063 
Leu Val Thr Leu lie Leu Leu Gly He Leu Val Phe Gly He Trp Phe 
245 250 255" 

gec tat age cga ggc cac ttt gac aga aca aag aaa ggg act teg agt 1111 
Ala Tyr Ser Arg Gly His Phe Asp Arg Thr Lys Lys Gly Thr Ser Ser . 
260 265 " 270 275 

aag aag gtg att tac age cag cct agt gec cga agt gaa gga gaa ttc 1159 
Lys Lys Val He Tyr Ser Gin Pro Ser Ala Arg Ser Glu Gly Glu Phe 
280 285 290 

aaa cag acc teg tea ttc ctg gtg tgagcctggt cggctcaccg cctatcatct 1213 
Lys Gin Thr Ser Ser Phe Leu Val 
295 



geatttgect 


tactcaggtg 


ctactggact 


ctggcccctg 


atgtctgtag 


tttcacagga 


1273 


tgccttattt 


gtcttctaca 


ccccacaggg 


ccccctactt 


etteggatgt 


gtttttaata 


1333 


atgtcagcta 


tgtgccccat 


cctccttcat 


gccctccctc 


cctttcctac 


cactgetgag 


1393 


tggcctggaa 


cttgtttaaa 


gtgtttattc 


cccatttctt 


tgagggatca 


ggaaggaatc 


1453 


ctgggtatgc 


cattgacttc 


ccttctaagt 


agacagcaaa 


aatggcgggg 


gtcgcaggaa 


1513 


tctgcactca 


actgcccacc 


tggctggcag 


ggatctttga 


ataggtatct 


tgagcttggt 


1573 


tctgggctct 


ttccttgtgt 


actgacgacc 


agggecaget 


gttctagagt 


gggaattaga 


1633 


ggctagagcg 


gctgaaatgg 


ttgtttggtg 


atgacactgg 


ggtccttcca 


tctctggggc 


1693 
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ccactctctt 


ctgtcttccc atgggaagtg 


ccactgggat 


ccctctgccc tgtcctcctg 


1753 


aatacaagct 


gactgacatt gactgtgtct 


gtggaaaatg 


ggagctcttg ttgtggagag 


1813 


catagtaaat 


tttcagagaa cttgaagcga 


aaaggattta 


aaaccgctgc tctaaagaaa 


1873 


agaaaactgg 


aggctgggcg cagtggctca 


cgcctgtaat 


cccagaggct gaggcaggcg 


1933 


gatcacctga 


ggtcgggagt tcgggatcag 


cctgaccaac 


atggagaaac cctgctggaa 


1993 


atacagagtt 


agccaggcat ggtggtgcat 


gcctgtagtc 


ccagctgctc aggagcctgg 


2053 


caacaagagc 


aaaactccag ctcaaaaaaa 


aaaaaaaaaa 


aaaaaaa 


2100 



<210> 54 

<211> 299 

<212> PRT 

<213> AF111713 JAM1, junctional adhesion molecule 1 

<400> 54 

Met Gly Thr Lys Ala Gin Val Glu Arg Lys Leu Leu Cys Leu Phe He 
1 5 10 15 

Leu Ala He Leu Leu Cys Ser Leu Ala Leu Gly Ser Val Thr Val His 
20 25 30 

Ser Ser Glu Pro Glu Val Arg He Pro Glu Asn Asn Pro Val Lys Leu 
35 40 45 

Ser Cys Ala Tyr Ser Gly Phe Ser Ser. Pro Arg Val Glu Trp Lys Phe 
50 55 60 

Asp Gin Gly Asp Thr Thr Arg Leu Val Cys Tyr Asn Asn Lys He Thr 
65 70 .75 80 

Ala Ser Tyr Glu Asp Arg Val Thr Phe Leu Pro Thr Gly He Thr Phe 
85 90 95 

Lys Ser Val Thr Arg Glu Asp Thr Gly Thr Tyr Thr Cys Met Val Ser 
100 105 110 

Glu Glu Gly Gly Asn Ser Tyr Gly Glu Val Lys Val Lys Leu He Val 
115 120 125 

Leu Val Pro Pro Ser Lys Pro Thr Val Asn He Pro Ser Ser Ala Thr 
130 135 140 

He Gly Asn Arg Ala Val Leu Thr Cys Ser Glu Gin Asp Gly Ser Pro 
145 150 155 160 
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Pro Ser Glu Tyr Thr Trp Phe Lys Asp Gly He Val Met Pro Thr Asn 
165 170 175 



Pro Lys Ser Thr Arg Ala Phe Ser Asn Ser Ser Tyr Val Leu Asn Pro 
180 185 190 



Thr Thr Gly Glu Leu Val Phe Asp Pro Leu Ser Ala .Ser Asp Thr Gly 
195 200 205 



Glu Tyr Ser Cys Glu Ala Arg Asn Gly Tyr Gly Thr Pro Met Thr Ser 
210 215 220 



Asn Ala Val Arg Met Glu Ala Val Glu Arg Asn Val Gly Val He Val 
225 230 235 240 



Ala Ala Val Leu Val Thr Leu He Leu Leu Gly He Leu Val Phe Gly 
245 250 255 



He Trp Phe Ala Tyr Ser Arg Gly His Phe Asp Arg Thr Lys Lys Gly 
260 265 270 



Thr Ser Ser Lys Lys Val He Tyr Ser Gin Pro Ser Ala Arg Ser Glu 
275 280 285 



Gly Glu Phe Lys Gin Thr Ser Ser Phe Leu Val 
290 295 



<210> 55 

<211> 2154 

<212> DNA 

<213> NMJ)06636 MTHFD2, methylene tetrahydrof olate 
<220> 

<221> CDS 

<222> (77) . . (1108) 

<223> 



<400> 55 

atataaccgc gtggcccgcg cgcgcgcttc cctcccggcg cagtcaccgg cgcggtctat 60 

ggctgcgact tctcta atg tct get ttg get gee egg ctg ctg cag ccc gcg 112 
Met Ser Ala Leu Ala Ala Arg Leu Leu Gin Pro Ala 
1 5 10 

cac age tgc tec ctt cgc ctt cgc cct ttc cac etc gcg gca gtt cga 160 
His Ser Cys Ser Leu Arg Leu Arg Pro Phe His Leu Ala Ala Val Arg 
15 20 25 
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aat gaa get gtt gtc att tct gga agg aaa ctg gee cag cag ate aag 
Asn Glu Ala Val Val He Ser Gly Arg Lys Leu Ala Gin Gin lie Lys 
30 35 40 



get gtt tct cca gac aag gat gtt gat ggc ttt cat gta att aat gta 
Ala Val Ser Pro Asp Lys Asp Val Asp Gly Phe His Val lie Asn Val 
145 150 155 



act gtt aca ata tct cat cga tat act ccc aaa gag cag ttg aag aaa 
Thr Val Thr He Ser His Arg Tyr Thr Pro Lys Glu Gin Leu Lys Lys 
225 230 235 



ctg ate aca gca gat atg ate aag gaa gga gca gca gtc att gat gtg 

Leu He Thr Ala Asp Met He Lys Glu Gly Ala Ala Val He Asp Val 

255 260 265 

gga ata aat aga gtt cac gat cct gta act gee aaa ccc aag ttg gtt 

Gly lie Asn Arg Val His Asp Pro Val Thr Ala Lys Pro Lys Leu Val 

270 275 280 

gga gat gtg gat ttt gaa gga gtc aga caa aaa get ggg tat ate act 
Gly Asp Val Asp Phe Glu Gly Val Arg Gin Lys Ala Gly Tyr He Thr 

285 290 295 300 



208 



cag gaa gtg egg cag gag gta gaa gag tgg gtg gee tea ggc aac aaa 256 
Gin Glu Val Arg Gin Glu Val Glu Glu Trp Val Ala Ser Gly Asn Lys 
45 50 55 60 

egg cca cac ctg agt gtg ate ctg gtt ggc gag aat cct gca agt cac 304 
Arg Pro His Leu Ser Val He Leu Val Gly Glu Asn Pro Ala Ser His 
65 70 . 75 

tec tat gtc etc aac aaa ace agg gca get gca gtt gtg gga ate aac 352 
Ser Tyr Val Leu Asn Lys Thr Arg Ala Ala Ala Val Val Gly He Asn 
80 85 90 

agt gag aca att atg aaa cca get tea att tea gag gaa gaa ttg ttg 
Ser Glu Thr He Met Lys Pro Ala Ser He Ser Glu Glu Glu Leu Leu 
95 100 . \ 105 

aat tta ate aat aaa ctg aat aat gat gat aat gta gat ggc etc ctt 448 
Asn Leu He Asn Lys Leu Asn Asn Asp Asp Asn Val Asp Gly Leu Leu 
110 115 120 

gtt cag ttg cct ctt cca gag cat att gat gag aga agg ate tgc aat 496 
Val Gin Leu Pro Leu Pro Glu His He Asp Glu Arg Arg He Cys Asn 
125 130 135 140 



400 



544 



gga cga atg tgt ttg gat cag tat tec atg tta ccg get act cca tgg 592 
Gly Arg Met Cys Leu Asp Gin Tyr Ser Met Leu Pro Ala Thr Pro Trp 
160 165 170 

ggt gtg tgg gaa ata ate aag cga act ggc att cca acc eta ggg aag 640 
Gly Val Trp Glu He He Lys Arg Thr Gly He Pro Thr Leu Gly Lys 
175 180 185 

aat gtg gtt gtg get gga agg tea aaa aac gtt gga atg ccc att gca 688 
Asn Val Val Val Ala Gly Arg Ser Lys Asn Val Gly Met Pro He Ala 
190 195 200 

atg tta ctg cac aca gat ggg gcg cat gaa cgt ccc gga ggt gat gee 736 
Met- Leu Leu His Thr Asp Gly Ala His Glu Arg Pro Gly Gly Asp Ala 
205 210 215 220 



784 



cat aca att ctt gca gat att gta ata tct get gca ggt att cca aat 832 
His Thr He Leu Ala Asp He Val He Ser Ala Ala Gly He Pro Asn 
240 245 250 



880 



928 



976 
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cca gtt cct gga ggt gtt ggc ccc atg aca gtg gca- atg eta atg aag 1024 
Pro Val Pro Gly Gly Val Gly Pro Met Thr Val Ala Met Leu Met Lys 
305 310 315 

aat acc att att get gca aaa aag gtg ctg agg ctt gaa gag cga gaa 1072 
Asn Thr He He Ala Ala Lys Lys Val Leu Arg Leu Glu Glu Arg Glu 
320 325 330 

gtg ctg aag tct aaa gag ctt ggg gta gec act aat taactactgt 1118 
Val Leu Lys Ser Lys Glu Leu Gly Val Ala Thr Asn 
335 340 



gtcttctgtg 


tcacaaacag 


cactccaggc 


cagctcaaga 


agcaaagcag 


gecaatagaa 


1178 


atgeaatatt 


tttaatttat 


tctactgaaa 


tggtttaaaa 


tgatgccttg 


tatttattga 


1238 


aagcttaaat 


gggtgggtgt 


ttctgeacat 


acctctgcag 


tacctcacca 


gggagcattc 


1298 


cagtatcatg 


cagggtcctg 


tgatctagee 


aggagcagee 


attaacctag 


tgattaatat 


1358 


gggagacatt 


accatatgga 


ggatggatgc 


ttcactttgt 


caagcacctc 


agttacacat 


1418 


tcgccttttc 


taggattgea 


tttcccaagt 


getattgeaa 


taacagttga 


tactcatttt 


1478 


aggtaccaga 


ccttttgagt 


tcaactgatc 


aaaccaaagg 


aaaagtgttg 


ctagagaaaa 


1538 


ttggggaaaa 


ggtgaaaaag 


aaaaaatggt 


agtaattgag 


cagaaaaaaa 


ttaatttata 


1598 


tatgtattga 


ttggcaacca 


gatttatcta 


agtagaactg 


aattggctag 


gaaaaaagaa 


1658 


aaactgcatg 


ttaatcattt 


tcctaagctg 


tccttttgag 


gcttagtcag 


tttattggga 


1718 


aaatgtttag 


gattattcct 


tgctattagt 


actcatttta 


tgtatgttac 


ccttcagtaa 


1778 


gttctcccca 


ttttagtttt 


ctaggactga 


aaggattctt 


ttctacatta 


tacatgtgtg 


1838 


ttgtcatatt 


tggcttttgc 


tatatacttt 


aacttcattg 


ttaaattttt 


gtattgtata 


1898 


gtttctttgg 


tgtatcttaa 


aacctatttt 


tgaaaaacaa 


acttggcttg 


ataatcattt 


1958 


gggcagcttg 


ggtaagtacg 


caacttactt 


ttccaccaaa 


gaactgtcag 


cagctgcctg 


2018 


cttttctgtg 


atgtatgtat 


cctgttgact 


tttccagaaa 


ttttttaaga 


gtttgagtta 


2078 


ctattgaatt 


taatcagact 


ttctgattaa 


agggttttct 


ttctttttta 


ataaaacaca 


2138 


tctgtctggt 


at ggt a 










2154 



<210> 56 

<211> 344 

<212> PRT 

<213> NM_006636 MTKFD2, methylene tetrahydrof olate 



<400> 56 

Met Ser Ala Leu Ala Ala Arg Leu 
1 5 



Leu Gin Pro Ala His Ser Cys Ser 
10 .15 



Leu Arg Leu Arg Pro Phe His Leu Ala Ala Val Arg Asn Glu Ala Val 
20 25 30 
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Val lie Ser Gly Arg Lys Leu Ala Gin Gin lie Lys Gin Glu Val Arg 
35 40 45 



Gin Glu Val Glu Glu Trp Val Ala Ser Gly Asn Lys Arg Pro His Leu 
50 55 60 



Ser Val lie Leu Val Gly Glu Asn Pro Ala Ser His Ser Tyr Val Leu 
65 70 75 80 



Asn Lys Thr Arg Ala Ala Ala Val Val Gly lie . Asn Ser Glu Thr He 
85 90 95 



Met Lys Pro Ala Ser He Ser Glu Glu Glu Leu Leu Asn Leu He Asn 
100 105 110 



Lys Leu Asn Asn Asp Asp Asn Val Asp Gly Leu Leu Val Gin Leu Pro 
115 120 125 



Leu Pro Glu His He Asp Glu Arg Arg He Cys Asn Ala Val Ser Pro 
130 135 140 



Asp Lys Asp Val Asp Gly Phe His Val He Asn Val Gly Arg Met Cys 
145 " 150 155 160 



Leu Asp Gin Tyr Ser Met Leu Pro Ala- Thr Pro Trp Gly Val Trp Glu 
165 170 175 



He He Lys Arg Thr Gly He Pro Thr Leu Gly Lys Asn Val Val Val 
180 185 * 190 



Ala Gly Arg Ser Lys Asn Val Gly Met Pro He Ala Met Leu Leu His 
195 200 205 



Thr Asp Gly Ala His Glu Arg Pro Gly Gly Asp Ala Thr Val Thr He 
210 215 220 



Ser His Arg Tyr Thr Pro Lys Glu Gin .Leu Lys Lys His Thr He Leu 
225 230 235 240 



Ala Asp lie Val He Ser Ala Ala Gly He Pro Asn Leu He Thr Ala 
245 250 255 



Asp Met He Lys Glu Gly Ala Ala Val He Asp Val Gly He Asn Arg 
260 265 270 



Val His Asp Pro Val Thr Ala Lys Pro Lys Leu Val Gly Asp Val Asp 
275 280 285 



Phe Glu Gly Val Arg Gin Lys Ala Gly Tyr He Thr Pro Val Pro Gly 
290 295 300 
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Gly Val Gly Pro Met Thr Val' Ala Met Leu Met Lys Asn Thr lie lie 
305 310 315 320 



Ala Ala Lys Lys Val Leu Arg Leu Glu Glu Arg Glu Val Leu Lys Ser 
325 330 335 



Lys Glu Leu Gly Val Ala Thr Asn 
340 



<210> 57 

<211> 1117 

<212> DNA 

<213> NM_006149 galectin 4, L GALS 4 



<220> 

<221> CDS 

<222> (57) . . (1025) 

<223> 



<400> 57 

atctcccact cctgcagctc ttctcacagg accagccact agcgcagcct cgagcg atg 59 

Met 
1 

gcc tat gtc ccc gca ccg ggc tac cag ccc acc tac aac ccg acg ctg 107 
Ala Tyr Val Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr Leu 
5 10 15 

cct tac tac cag ccc ate ccg ggc ggg etc aac gtg gga atg tct gtt 155 
Pro Tyr Tyr Gin Pro lie Pro Gly Gly Leu Asn Val Gly Met Ser Val 
20 25 30 

tac ate caa gga gtg gcc age gag cac atg aag egg ttc ttc gtg aac 203 
Tyr He Gin Gly Val Ala Ser Glu His Met Lys Arg Phe Phe Val Asn 
35 40 45 

ttt gtg gtt ggg cag gat ccg ggc tea gac gtc gcc ttc cac ttc aat 251 
Phe Val Val Gly Gin Asp Pro Gly Ser Asp Val Ala Phe His Phe Asn 
50 55 60 65 

ccg egg ttt gac ggc tgg gac aag gtg gtc ttc aac acg ttg cag ggc 299 
Pro Arg Phe Asp Gly Trp Asp Lys Val Val Phe Asn Thr Leu Gin Gly 
70 * 75 80 

ggg aag tgg ggc age gag gag agg aag agg age atg ccc ttc aaa aag 347 
Gly Lys Trp Gly Ser Glu Glu Arg Lys Arg Ser Met Pro Phe Lys Lys 
85 90 95 

ggt gcc gcc ttt gag ctg gtc ttc ata gtc. ctg get gag cac tac aag 395 
Gly Ala Ala Phe Glu Leu Val Phe He Val Leu Ala Glu His Tyr Lys 
100 105 110 

gtg gtg gta aat gga aat ccc ttc tat gag tac ggg cac egg ctt ccc 443 
Val Val Val Asn Gly Asn Pro Phe Tyr Glu Tyr Gly His Arg Leu Pro 
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115 120 ' 125 

eta cag atg gtc acc cac ctg caa gtg gat ggg gat ctg caa ctt caa 491 
Leu Gin Met Val Thr His Leu Gin Val Asp Gly Asp Leu Gin Leu Gin 
130 135 140 145 

tea ate aac ttc ate gga ggc cag ccc etc egg ccc cag gga ccc ccg 539 
Ser He Asn Phe He Gly Gly Gin Pro Leu Arg Pro Gin Gly Pro Pro 
150 155 160 

atg atg cca cct tac cct ggt ccc gga cat tgc cat caa cag ctg aac 
Met Met Pro Pro Tyr Pro Gly Pro Gly His Cys His Gin Gin Leu Asn 
165 170 175 

age ctg ccc acc atg gaa gga ccc cca acc ttc aac ccg cct gtg cca 
Ser Leu Pro Thr Met Glu Gly Pro Pro Thr Phe Asn Pro Pro Val Pro 
180 185 190 

tat ttc ggg agg ctg caa gga ggg etc aca get cga aga acc ate ate 
Tyr Phe Gly Arg Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr lie He 
195 200 205 

ate aag ggc tat gtg cct ccc aca ggc aag age ttt get ate aac ttc 731 
He Lys Gly Tyr Val Pro Pro Thr Gly Lys Ser Phe Ala He Asn Phe 
210 ^ 215 220 225 

aag gtg ggc tec tea ggg gac ata get ctg cac att aat ccc cgc atg 779 
Lys Val Gly Ser Ser Gly Asp He Ala Leu His lie Asn Pro. Arg Met 
230 235 240 



ggc aac ggt acc gtg gtc egg aac age ctt ctg aat ggc teg tgg gga 
Gly Asn Gly Thr Val Val Arg Asn Ser Leu Leu Asn Gly Ser Trp Gly 
245 250 255 



587 



635 



683 



827 



tec gag gag aag aag ate acc cac aac cca ttt ggt ccc gga cag ttc 875 
Ser Glu Glu Lys Lys He Thr His Asn Pro Phe Gly Pro Gly Gin Phe 
260 265 -270 

ttt gat ctg tec att cgc tgt ggc ttg gat cgc ttc aag gtt tac gee 923 
Phe Asp Leu Ser He Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr Ala 
275 280 285 

aat ggc cag cac etc ttt gac ttt gee cat cgc etc teg gec ttc cag 971 
Asn Gly Gin His Leu Phe Asp Phe Ala His Arg Leu Ser Ala Phe Gin 
290 295 300 305 

agg gtg gac aca ttg gaa ate cag ggt gat gtc acc ttg tec tat gtc 1019 
Arg Val Asp Thr Leu Glu He Gin Gly Asp Val Thr Leu Ser Tyr Val 
310 315 320 

cag ate taatctattc ctggggccat aactcatggg aaaacagaat tatcccctag 1075 
Gin He 

gactcctttc taagccccta ataaaatgtc tgagggtgtc tc 1117 

<210> 58 

<211> 323 

<212> PRT 

<213> NMJ)06149 galectin 4, LGALS4 



<400> 58 
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Met Ala Tyr Val Pro Ala Pro Gly Tyr Gin Pro Thr Tyr Asn Pro Thr 
15 10 15 



Leu Pro Tyr Tyr Gin Pro He Pro Gly Gly Leu Asn Val Gly Met Ser 
20 25 30 



Val Tyr He Gin Gly Val Ala Ser Glu His Met Lys Arg Phe Phe Val 
35 .40 45 



Asn Phe Val Val Gly Gin Asp Pro Gly Ser Asp Val -Ala Phe His Phe 
50 55 60 



Asn Pro Arg Phe Asp Gly Trp Asp Lys Val Val Phe Asn Thr Leu Gin 
65 " 70 75 80 



Gly Gly Lys Trp Gly Ser Glu Glu Arg Lys Arg Ser Met Pro Phe Lys 
85 90 95 



Lys Gly Ala Ala Phe Glu Leu Val Phe He Val Leu Ala Glu His Tyr 
100 105 110 



Lys Val Val Val Asn Gly Asn Pro Phe Tyr Glu Tyr Gly His Arg Leu 
115 120 125 



Pro Leu Gin Met Val Thr His Leu Gin Val Asp Gly Asp Leu Gin Leu 
130 135 140 



Gin Ser He Asn Phe He Gly Gly Gin Pro Leu Arg Pro Gin Gly Pro 
145 150 155 160 



Pro Met Met Pro Pro Tyr Pro Gly Pro Gly His Cys His Gin Gin Leu 
165 170 175 



Asn Ser Leu Pro Thr Met Glu Gly Pro Pro Thr Phe Asn Pro Pro Val 
180 185 190 



Pro Tyr Phe Gly Arg Leu Gin Gly Gly Leu Thr Ala Arg Arg Thr lie 
195 200 205 



lie He Lys Gly Tyr Val Pro Pro Thr Gly Lys Ser Phe Ala He Asn 
210 215 220 



Phe Lys Val Gly Ser Ser Gly Asp He Ala Leu His He Asn Pro Arg 
225 230 235 240 



Met Gly Asn Gly Thr Val Val Arg Asn Ser Leu Leu Asn Gly Ser Trp 
245 250 255 



Gly Ser Glu Glu Lys Lys He Thr His Asn Pro Phe Gly Pro Gly Gin 
260 265 270 
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Phe Phe Asp Leu Ser lie Arg Cys Gly Leu Asp Arg Phe Lys Val Tyr 
275 280 285 



Ala Asn Gly Gin His Leu Phe Asp Phe Ala His Arg Leu Ser Ala Phe 
290 295 300 



Gin Arg Val Asp Thr Leu Glu lie Gin Gly Asp Val Thr Leu Ser Tyr 
305 310 315 320 

Val Gin He 

<210> 59 

<211> 3697 

<212> DNA 

<213> NM_004063 cadherin 17, CDH17 



<220> 

<221> CDS 

<222> (121) . . (2616) 

<223> 



<400> 59 

agggagtgtt cccgggggag atactccagt cgtagcaaga gtctcgacca ctgaatggaa 60 

gaaaaggact tttaaccacc attttgtgac ttacagaaag gaatttgaat aaagaaaact 120 

atg ata ctt cag gcc cat ctt cac tec ctg tgt ctt ctt atg ctt tat 168 
Met lie Leu Gin Ala His Leu His Ser Leu Cys Leu Leu Met Leu Tyr 
15 10 15 

ttg gca act gga tat ggc caa gag ggg aag ttt agt gga ccc ctg aaa 216 
Leu Ala Thr Gly Tyr Gly Gin Glu Gly Lys Phe Ser Gly Pro Leu Lys 
20 25 30 

ccc atg aca ttt tct att tat gaa ggc caa gaa ccg agt caa att ata 264 
Pro Met Thr Phe Ser He Tyr Glu Gly Gin Glu Pro Ser Gin He He 
35 40 45 

ttc cag ttt aag gcc aat cct cct get gtg act ttt gaa eta act ggg 312 
Phe Gin Phe Lys Ala Asn Pro Pro Ala Val Thr Phe Glu Leu Thr Gly 
50 55 60 

gag aca gac aac ata ttt gtg ata gaa egg gag gga ctt ctg tat tac 360 
Glu Thr Asp Asn He Phe Val He Glu Arg Glu Gly Leu Leu Tyr Tyr 
65 70 75 80 

aac aga gcc ttg gac agg gaa aca aga tct act cac aat etc cag gtt 408 
Asn Arg Ala Leu Asp Arg Glu Thr Arg Ser Thr His Asn Leu Gin Val 
85 90 95 

gca gcc ctg gac get aat gga att ata gtg gag ggt cca gtc cct ate 456 
Ala Ala Leu Asp Ala Asn Gly He He Val Glu Gly Pro Val Pro He 



WO 2004/022778 



PCT/AU2003/001166 



147 



100 105 110 

acc ata gaa gtg aag gac ate aac gac aat cga ccc acg ttt etc cag 504 
Thr lie Glu Val Lys Asp He Asn Asp Asn Arg Pro Thr Phe Leu Gin 
115 120 125 

tea aag tac gaa ggc tea gta agg cag aac tct cgc cca gga aag ccc 552 
Ser Lys Tyr Glu Gly Ser Val Arg Gin Asn Ser Arg Pro Gly Lys Pro 
130 135 140 

ttc ttg tat gtc aat gec aca gac ctg gat gat ccg gee act ccc aat 600 
Phe Leu Tyr Val Asn Ala Thr Asp Leu Asp Asp Pro Ala Thr Pro Asn 
145 150 155 160 

ggc cag ctt tat tac cag att gtc ate cag ctt ccc atg ate aac aat 648 
Gly Gin Leu Tyr Tyr Gin He Val He Gin Leu Pro Met He Asn Asn 
165 170 175 

gtc atg tac ttt cag ate aac aac aaa acg gga gec ate tct ctt acc 696 
Val Met Tyr Phe Gin He Asn Asn Lys Thr Gly Ala He Ser Leu Thr 
180 185 . • . 190 

cga gag gga tct cag gaa ttg aat cct get aag aat cct tec tat aat 744 
Arg Glu Gly Ser Gin Glu Leu Asn Pro Ala Lys Asn Pro Ser Tyr Asn 
195 200 205 

ctg gtg ate tea gtg aag gac atg gga ggc cag agt gag aat tec ttc 792 
Leu Val He Ser Val Lys Asp Met Gly Gly Gin Ser Glu Asn Ser Phe 
210 215 220 

agt gat acc aca tct gtg gat ate ata gtg aca gag aat att tgg aaa 840 
Ser Asp Thr Thr Ser Val Asp He He Val Thr Glu Asn He Trp Lys 
225 230 235 240 

gca cca aaa cct gtg gag atg gtg gaa aac tea act gat cct cac ccc 888 
Ala Pro Lys Pro Val Glu Met Val Glu Asn Ser Thr Asp Pro His Pro 
245 250 25-5 

ate aaa ate act cag gtg egg tgg aat gat ccc ggt gca caa tat tec 936 
He Lys He Thr Gin Val Arg Trp Asn Asp Pro Gly Ala Gin Tyr Ser 
260 265 270 

tta gtt gac aaa gag aag ctg cca aga ttc cca ttt tea att gac cag 984 
Leu Val Asp Lys Glu Lys Leu Pro Arg Phe Pro Phe Ser He Asp Gin 
275 280 285 

gaa gga gat att tac gtg act cag ccc ttg gac cga gaa gaa aag gat 1032 
Glu Gly Asp He Tyr Val Thr Gin Pro Leu Asp Arg Glu Glu Lys Asp 
290 295 300 

gca tat gtt ttt tat gca gtt gca aag gat gag tac gga aaa cca ctt 1080 
Ala Tyr Val Phe Tyr Ala Val Ala Lys Asp Glu Tyr Gly Lys Pro Leu 
305 310 ' 315 ~ 320 

tea tat ccg ctg gaa att cat gta aaa gtt aaa gat att aat gat aat 1128 
Ser Tyr Pro Leu Glu He His Val Lys Val Lys Asp He Asn Asp Asn 
325 330 335 

cca cct aca tgt ccg tea cca gta acc gta ttt gag gtc cag gag aat 1176 
Pro Pro Thr Cys Pro Ser Pro Val Thr Val Phe Glu Val Gin Glu Asn 
340 345 350 

gaa cga ctg ggt aac agt ate ggg acc ctt act gca cat gac agg gat 1224 
Glu Arg Leu Gly Asn Ser He Gly Thr Leu Thr Ala His Asp Arg Asp 
355 360 365 

gaa gaa aat act gee aac agt ttt eta aac tac agg att gtg gag caa 1272 
Glu Glu Asn Thr Ala Asn Ser Phe Leu Asn Tyr Arg He Val Glu Gin 
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370 375 ■ 380 

act ccc aaa ctt ccc atg gat gga etc ttc eta ate caa acc tat get 1320 
Thr Pro Lys Leu Pro Met Asp Gly Leu Phe Leu lie Gin Thr Tyr Ala 
385 390 395 400 

gga atg tta cag tta get aaa cag tec ttg aag aag caa gat act cct 1368 
Gly Met Leu Gin Leu Ala Lys Gin Ser Leu Lys Lys Gin Asp Thr Pro 
405 410 415 

cag tac aac tta acg ata gag gtg tct gac aaa gat ttc aag acc ctt 1416 
Gin Tyr Asn Leu Thr He Glu Val Ser Asp Lys Asp Phe Lys Thr Leu 
420 425 430 

tgt ttt gtg caa ate aac gtt att gat ate aat gat cag ate ccc ate 1464 
Cys Phe Val Gin He Asn Val He Asp He Asn Asp Gin He Pro He 
435 440 445 

ttt gaa aaa tea gat tat gga aac ctg act ctt get gaa gac aca aac 1512 
Phe Glu Lys Ser Asp Tyr Gly Asn Leu Thr Leu Ala Glu Asp Thr Asn 
450 455 460 

att ggg tec acc ate tta acc ate cag gee act gat get gat gag cca 1560 
He Gly Ser Thr He Leu Thr He Gin Ala Thr Asp Ala Asp Glu Pro 
465 470 475 480 

ttt act ggg agt tct aaa att ctg tat cat ate ata aag gga gac agt 1608 
Phe Thr Gly Ser Ser Lys He Leu Tyr His He He Lys Gly Asp Ser 
485 490 " 495 

gag gga cgc ctg ggg gtt gac aca gat ccc cat acc aac acc gga tat 1656 
Glu Gly Arg Leu Gly Val Asp Thr Asp Pro His Thr Asn Thr Gly Tyr 
500 505 510 

gtc ata att aaa aag cct ctt gat ttt gaa aca gca get gtt tec aac 1704 
Val He He Lys Lys Pro Leu Asp Phe Glu Thr Ala Ala Val Ser Asn 
515 520 525 

att gtg ttc aaa gca gaa aat cct gag cct eta gtg ttt ggt gtg aag 1752 
He Val Phe Lys Ala Glu Asn Pro Glu Pro Leu Val Phe Gly Val Lys 
530 535 540 

tac aat gca agt tct ttt gec aag ttc acg ctt att gtg aca gat gtg 1800 
Tyr Asn Ala Ser Ser Phe Ala Lys Phe Thr Leu lie Val Thr Asp Val 
545 550 555 560 

aat gaa gca cct caa ttt tec caa cac gta ttc caa gcg aaa gtc agt 1848 
Asn Glu Ala Pro Gin Phe Ser Gin His Val Phe Gin Ala Lys Val Ser 
565 570 575 

gag gat gta get ata ggc act aaa gtg ggcaat gtg act gec aag gat 18 96 
Glu Asp Val Ala He Gly Thr Lys Val Gly Asn Val Thr Ala Lys Asp 
580 585 590 

cca. gaa ggt ctg gac ata age tat tea ctg agg. gga gac aca aga ggt 1944 
Pro Glu Gly Leu Asp He Ser Tyr Ser Leu Arg Gly Asp Thr Arg Gly 
595 600 605 

tgg ctt aaa att gac cac gtg act ggt gag ate ttt agt gtg get cca 1992 
Trp Leu Lys He Asp His Val Thr Gly Glu lie Phe Ser Val Ala Pro 
610 615 620 

ttg gac aga gaa gec gga agt cca tat egg gta caa gtg gtg gec aca 2040 
Leu Asp Arg Glu Ala Gly Ser Pro Tyr Arg Val Gin Val Val Ala Thr 
625 630 635 640 

gaa gta ggg ggg tct tec ttg age tct gtg tea gag ttc cac ctg ate 2088 
Glu Val Gly Gly Ser Ser Leu Ser Ser Val Ser Glu Phe His Leu He 
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645 .650 655 

ctt atg gat gtg aat gac aac cct ccc agg eta gec aag gac tac acg 2136 

Leu Met Asp Val Asn Asp Asn Pro Pro Arg Leu Ala Lys Asp Tyr Thr 
660 665 670 

ggc ttg ttc ttc tgc cat ccc etc agt gca cct gga agt etc att ttc 2184 

Gly Leu Phe Phe 'Cys His Pro Leu Ser Ala Pro Gly Ser Leu lie Phe 
675 680 685 



gag get act gat gat gat cag cac tta ttt egg ggt ccc cat ttt aca 2232 

Glu Ala Thr Asp Asp Asp Gin His Leu Phe Arg Gly Pro His Phe Thr 

690 695 700 

ttt tec etc ggc agt gga age tta caa aac gac tgg gaa gtt tec aaa 2280 

Phe Ser Leu Gly Ser Gly Ser Leu Gin Asn Asp Trp Glu Val Ser Lys 

705 710 715 720 

ate aat ggt act cat gee cga ctg tct acc agg cac aca gag ttt gag 2328 

lie Asn Gly Thr His Ala Arg Leu Ser Thr Arg His Thr Glu Phe Glu 
725 730 735 

gag agg gag tat gtc gtc ttg ate egcate aat gat ggg ggt egg cca 2376 

Glu Arg Glu Tyr Val Val Leu lie Arg lie Asn Asp Gly Gly Arg Pro 
740 745 750 

ccc ttg gaa ggc att gtt tct tta cca gtt aca ttc tgc agt tgt gtg 2424 

Pro Leu Glu Gly He Val Ser Leu Pro Val Thr Phe Cys Ser Cys Val 
755 760 765 

gaa gga agt tgt ttc egg cca gca ggt cac cag act ggg ata ccc act 2472 

Glu Gly Ser Cys Phe Arg Pro Ala Gly His Gin Thr Gly He Pro Thr 

770 775 780 

gtg ggc atg gca gtt ggt ata ctg ctg acc acc ctt ctg gtg att ggt 2520 

Val Gly Met Ala Val Gly He Leu Leu Thr Thr Leu Leu Val He' Gly 

785 790 795 800 

ata att tta gca gtt gtg ttt ate cgc ata aag aag gat aaa ggc aaa 2568 

He He Leu Ala Val Val Phe He Arg He Lys Lys Asp Lys Gly Lys 
805 810 815 



gat aat gtt gaa agt get caa gca tct gaa gtc aaa cct ctg aga age 2 616 

Asp Asn Val Glu Ser Ala Gin Ala Ser Glu Val Lys Pro Leu Arg Ser 
820 825 830 



tgaatttgaa 


aaggaatgtt 


tgaatttata 


tagcaagtgc 


tatttcagca 


acaaccatct 


2676 


catcctatta 


cttttcatct 


aacgtgcatt 


ataatttttt 


aaacagatat 


tccctcttgt 


2736 


cctttaatat 


ttgetaaata 


tttctttttt 


gaggtggagt 


ettgetctgt 


cgcccaggct 


2796 


ggagtacagt 


ggtgtgatcc 


cagctcactg 


caacctccgc 


ctcctgggtt 


cacatgattc 


2856 


tcctgcctca 


gcttcctaag 


tagctgggtt 


tacaggcacc 


caccaccatg 


cccagctaat 


2916 


ttttgtattt 


ttaatagaga 


eggggttteg 


ccatttggcc 


aggctggtct 


tgaactcctg 


2976 


aegtcaagtg 


atctgcctgc 


cttggtctcc 


caatacaggc 


atgaaccact 


gcacccacct 


3036 


acttagatat 


ttcatgtgct 


atagacatta 


gagagatttt 


tcatttttcc 


atgacatttt 


3096 


tcctctctgc 


aaatggctta 


gctacttgtg 


tttttccctt 


ttggggcaag 


acagactcat 


3156 


taaatattct 


gtacattttt 


tctttatcaa 


ggagatatat 


cagtgttgtc 


tcatagaact 


3216 


gectggatte 


catttatgtt 


ttttctgatt 


ccatcctgtg 


tccccttcat 


ccttgactcc 


3276 
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tttggtattt cactgaattt caaacatttg tcagagaaga aaaacgtgag gactcaggaa 3336 

aaataaataa ataaaagaac agccttttcc cttagtatta acagaaatgt ttctgtgtca 3396 

ttaaccatct ttaatcaatg tgacatgttg ctctttggct gaaattcttc aacttggaaa 3456 

tgacacagac ccacagaagg tgttcaaaca caacctactc tgcaaacctt ggtaaaggaa 3516 

ccagtcagct ggccagattt cctcactacc tgccatgcat acatgctgcg catgttttct 3576 

tcattcgtat gttagtaaag ttttggttat tatatattta acatgtggaa gaaaacaaga 3636 

catgaaaaga gtggtgacaa atcaagaata aacactggtt gtagtcagtt ttgtttgtta 3696 

a 3597 

<210> 60 
<211> 832 
<212> PRT 

<213> NM_004063 cadherin 17, CDH17 
<400> 60 

Met He Leu Gin Ala His Leu His Ser Leu Cys Leu Leu Met Leu Tyr 
15 10 15 

Leu Ala Thr Gly Tyr Gly Gin Glu Gly Lys Phe Ser Gly Pro Leu Lys 
20 25 30 

Pro Met Thr Phe Ser He Tyr Glu Gly Gin Glu Pro Ser Gin He He 
35 40 45 

Phe Gin Phe Lys Ala Asn Pro Pro Ala Val Thr Phe Glu Leu Thr Gly 
50 55 60 

Glu Thr Asp Asn He Phe Val He Glu Arg Glu Gly Leu Leu Tyr Tyr 
65 70 75 • 80 

Asn Arg Ala Leu Asp Arg Glu Thr Arg Ser Thr His Asn Leu Gin Val 
85 90 .95 

Ala Ala Leu Asp Ala Asn Gly He He Val Glu Gly Pro Val Pro He 
100 105 110 

Thr He Glu Val Lys Asp He Asn Asp Asn Arg Pro Thr Phe Leu Gin 
115 120 125 

Ser Lys Tyr Glu Gly Ser Val Arg Gin Asn Ser Arg Pro Gly Lys Pro 
130 135 140 



Phe Leu Tyr Val Asn Ala Thr Asp Leu Asp Asp Pro Ala Thr Pro Asn 
145 150 155 160 
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Gly Gin Leu Tyr Tyr Gin lie Val He Gin Leu Pro Met He Asn Asn 
165 170 175 



Val Met Tyr Phe Gin He Asn Asn Lys Thr Gly Ala He Ser Leu Thr 
180 185 190 



Arg Glu Gly Ser Gin Glu Leu Asn Pro Ala Lys Asn Pro Ser Tyr Asn 
195 200 205 



Leu Val He Ser Val Lys Asp Met Gly Gly Gin Ser Glu Asn Ser Phe 
210 215 220 



Ser Asp Thr Thr Ser Val Asp He He Val Thr Glu Asn He Trp Lys 
225 230 235 240 



Ala Pro Lys Pro Val Glu Met Val Glu Asn Ser Thr Asp Pro His Pro 
245 250 255 

He Lys He Thr Gin Val Arg Trp Asn Asp Pro Gly Ala Gin Tyr Ser 
260 265 270 

Leu Val Asp Lys Glu Lys Leu Pro Arg Phe Pro Phe Ser He Asp Gin 
275 280 285 



Glu Gly Asp He Tyr Val Thr Gin Pro Leu Asp Arg Glu Glu Lys Asp 
290 295 300 



Ala Tyr Val Phe Tyr Ala Val Ala Lys Asp Glu Tyr Gly Lys Pro Leu 
305 310 315 " 320 

Ser Tyr Pro Leu Glu He His Val Lys Val Lys Asp He Asn Asp Asn 
325 330 335 



Pro Pro Thr Cys Pro Ser Pro Val Thr Val Phe Glu Val Gin Glu Asn 
340 345 350 



Glu Arg Leu Gly Asn Ser He Gly Thr Leu Thr Ala His Asp Arg Asp 
355 360 365 



Glu Glu Asn Thr Ala Asn Ser Phe Leu Asn Tyr Arg He Val Glu Gin 
370 375 380 



Thr Pro Lys Leu Pro Met Asp Gly Leu Phe Leu lie Gin Thr Tyr Ala 
385 390 395 400 



Gly Met Leu Gin Leu Ala Lys Gin Ser Leu Lys Lys Gin Asp Thr Pro 
405 410 * ~ 415 



Gin Tyr Asn Leu Thr He Glu Val Ser Asp Lys Asp Phe Lys Thr Leu 
420 425 430 
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Cys Phe Val Gin lie Asn Val lie Asp He Asn Asp Gin He Pro He 
435 440 * 445 



Phe Glu Lys Ser Asp Tyr Gly Asn Leu Thr Leu Ala Glu Asp Thr Asn 
450 455 4 60 



He Gly Ser Thr He Leu Thr He Gin Ala Thr Asp Ala Asp Glu Pro 
465 470 475 480 

Phe Thr Gly Ser Ser Lys He Leu Tyr His He He Lys Gly Asp Ser 
485 490 495 

Glu Gly Arg Leu. Gly Val Asp Thr Asp Pro His Thr Asn Thr Gly Tyr 
500 505 510 

Val He He Lys Lys Pro Leu Asp Phe Glu Thr Ala Ala Val Ser Asn 
515 520 525 

He Val Phe Lys Ala Glu Asn Pro Glu . Pro Leu Val Phe Gly Val Lys 
53 0 535 540 

Tyr Asn Ala Ser Ser Phe Ala Lys Phe Thr Leu He Val Thr Asp Val 
545 550 555 560 

Asn Glu Ala Pro Gin Phe Ser Gin His Val Phe Gin Ala Lys Val Ser 
565 570 575 

Glu Asp Val Ala He Gly Thr Lys Val Gly Asn Val Thr Ala Lys Asp 
580 585 590 

Pro Glu Gly Leu Asp lie Ser Tyr Ser Leu Arg Gly Asp Thr Arg Gly 
595 600 605 

Trp Leu Lys He Asp His Val Thr Gly Glu He Phe Ser Val Ala Pro 
610 615 ' 620 

Leu Asp Arg Glu Ala Gly Ser Pro Tyr Arg Val Gin Val Val Ala Thr 
625 630 635 640 

Glu Val Gly Gly Ser Ser Leu Ser Ser Val Ser Glu Phe His Leu He 
645 650 655 

Leu Met Asp Val Asn Asp Asn Pro Pro Arg Leu Ala Lys Asp Tyr Thr 
660 665 . 670 

Gly Leu Phe Phe Cys His Pro Leu Ser Ala Pro Gly Ser Leu He Phe 
675 680 685 



Glu Ala Thr Asp Asp Asp Gin His Leu Phe Arg Gly Pro His Phe Thr 
690 695 700 
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Phe Ser Leu Gly Ser Gly Ser Leu Gin Asn Asp Trp Glu Val Ser Lys 
705 710 715 720 



He Asn Gly Thr His Ala Arg Leu Ser Thr Arg His Thr Glu Phe Glu 
725 730 735 



Glu Arg Glu Tyr Val Val Leu He Arg He Asn Asp . Gly Gly Arg Pro 
740 745 - 750 



Pro Leu Glu Gly He Val Ser Leu Pro Val Thr Phe Cys Ser Cys Val 
755 ' 760 765 



Glu Gly Ser Cys Phe Arg Pro Ala Gly His Gin Thr Gly He Pro Thr 
770 775 780 



Val Gly Met Ala Val Gly He Leu Leu Thr Thr Leu Leu Val He Gly 
785 790 79.5 . 800 



He He Leu Ala Val Val Phe He Arg He Lys Lys Asp Lys Gly Lys 
805 810 815 



Asp Asn Val Glu Ser Ala Gin Ala Ser Glu Val Lys Pro Leu Arg Ser 
820 825 830 



<210> 61 

<211> 2920 

<212> DNA 

<213> NM_005588; meprin A, alpha 
<220> 

<221> CDS 

<222> (10) . . (2247) 

<223> 



<400> 61 

cttgcagca atg get tgg att aga tec act tgc att etc ttt ttt acc ttg 51 

Met Ala Trp He Arg Ser Thr Cys He Leu Phe Phe Thr Leu 
1 5 10 

ctt ttt gec cac ata gca get gta ccg att aag cat ctt cct gaa gaa 99 
Leu Phe Ala His He Ala Ala Val Pro He Lys His Leu Pro Glu Glu 
15 20 25 30 

aat gta cat gat gca gat ttt ggt gaa cag aag gat att tea gaa ate 147 
Asn Val His Asp Ala Asp Phe Gly Glu Gin Lys Asp He Ser Glu He 
35 40 45 



aat tta get gca ggc ttg gac etc ttt caa ggg gac ate etc ttg cag 
Asn Leu Ala Ala Gly Leu Asp Leu Phe Gin Gly Asp He Leu Leu Gin 



195 
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50 55 60 

aaa tec aga aat ggc ctg aga gac cca aac acc agg tgg acg ttc ccc 243 
Lys Ser Arg Asn Gly Leu Arg Asp Pro Asn Thr Arg Trp Thr Phe Pro 
65 70 . 75 

att cct tac ate ttg get gat aat ttg ggg ctg aat get aaa gga gee 291 
lie Pro Tyr He Leu Ala Asp Asn Leu Gly Leu Asn Ala Lys Gly Ala 
80 85 90 

att ctg tat gee ttt gag atg ttc cgt etc aag tec tgt gtg gat ttc 339 
He Leu Tyr Ala Phe Glu Met Phe Arg Leu Lys Sef Cys Val Asp Phe 
95 100 105 * 110 

aag ccc tat gaa gga gag age tea tat ate ata ttt caa cag ttt gat 387 
Lys Pro. Tyr Glu Gly Glu Ser Ser Tyr He He Phe Gin Gin Phe Asp 
115 120 125 

ggg tgc tgg tct gag gtt ggt gac caa cat gtg gga cag aac att tec 435 
Gly Cys Trp Ser Glu Val Gly Asp Gin His Val Gly Gin Asn He Ser 
130 135 140 

att ggc caa gga tgt gec tat aag gee ate ata gaa cac gag ate ctg 483 
He Gly Gin Gly Cys Ala Tyr Lys Ala He He Glu His Glu He Leu 
145 .150 155 

cat get ttg gga ttt tac cac gag cag tea agg acg gac egg gat gat 531 
His Ala Leu Gly Phe Tyr His Glu Gin Ser Arg Thr Asp Arg Asp Asp 
160 165 170 

tat gtg aac ate tgg tgg gac caa att ctt tea ggt tac cag cac aac 579 
Tyr Val Asn lie Trp Trp Asp Gin He Leu Ser Gly Tyr Gin His Asn 
175 180 185 190 

ttt gac acc tat gat gat age tta ate aca gac etc aat aca ccc tat 627 
Phe Asp Thr Tyr Asp Asp Ser Leu He Thr Asp Leu Asn Thr Pro Tyr 
195 200 205 

gat tat gag tct ttg atg cac tac cag cct ttc tea ttt aac aag aat 675 
Asp Tyr Glu Ser Leu Met His Tyr Gin Pro Phe Ser Phe Asn Lys Asn 
210 215 . 220 

gca agt gtt ccc acc ate aca gec aag ate cct gag ttt aac tec att 723 
Ala Ser Val Pro Thr He Thr Ala Lys He Pro Glu Phe Asn Ser He 
225 230 235 

ate gga caa cgc ctg gat ttc agt gee att gat tta gag agg ctg aac 771 
He Gly Gin Arg Leu Asp Phe Ser Ala He Asp Leu Glu Arg Leu Asn 
240 245 . 250 

cga atg tac aat tgc acc aca act cac act ctt ttg gac cac tgt act 819 
Arg Met Tyr Asn Cys Thr Thr Thr His Thr Leu Leu Asp His Cys Thr 
255 260 265 270 

ttt gag aag gca aac ate tgt gga atg att cag ggc acc aga gat gac 8 67 

Phe Glu Lys Ala Asn He Cys Gly Met He Gin Gly Thr Arg Asp Asp 
275 280 285 

act gac tgg gee cat cag gac agt get cag get gga gaa gtg gat cac 915 
Thr Asp Trp Ala His Gin Asp Ser Ala Gin Ala Gly Glu Val Asp His 
290 295 300 

acc ttg ttg gga caa tgc aca ggt gee ggc tac ttc atg cag ttc age 963 
Thr Leu Leu Gly Gin Cys Thr Gly Ala Gly Tyr Phe Met Gin Phe Ser 
305 310 315 



acc age teg ggg tec gcg gaa gag gca gee eta ctg gag tct egg att 1011 
Thr Ser Ser Gly Ser Ala Glu Glu Ala Ala Leu Leu Glu Ser Arg lie 
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320 325 330 

ctt tac cca aag agg aag cag cag tgc ctg caa ttt ttc tat aaa atg 1059 
Leu Tyr Pro Lys Arg Lys Gin Gin Cys Leu Gin Phe Phe Tyr Lys Met 
335 340 345 350 

acg gga agt cct tea gac aga etc gtt gtc tgg gtc agg agg gat gac 1107 
Thr Gly Ser Pro Ser Asp Arg Leu Val Val Trp Val Arg Arg Asp Asp 
355 360 365 

age aca ggc aat gtt cgc aag ttg gtg aag gtg cag act ttt caa gga 1155 
Ser Thr Gly Asn Val Arg Lys Leu Val Lys Val Gin Thr Phe Gin Gly 
370 375 380 

gat gat gac cac aat tgg aaa att gee cat gtg gtg etc aaa gag gaa 1203 
Asp Asp Asp His Asn Trp Lys lie Ala His Val Val Leu Lys Glu Glu 
385 390 395 

cag aag ttt cgc tac ctt ttc cag ggc aca* aaa ggc gac cct cag aac 1251 
Gin Lys Phe Arg Tyr Leu Phe Gin Gly Thr Lys Gly Asp Pro Gin Asn 
400 405 410 

tea act ggg gga att tac eta gat gac ate act ctg aca gaa ace ccc 1299 
Ser Thr Gly Gly He Tyr Leu Asp Asp He Thr Leu Thr Glu Thr Pro 
415 420 425 430 

tgc ccc aca ggg gtc tgg aca gtc egg aat ttc tec caa gtc ctt gag 1347 
Cys Pro Thr Gly Val Trp Thr Val Arg Asn Phe Ser Gin Val Leu Glu 
435 440 445 

aac ace age aaa ggg gac aag ctt cag age cct cga ttc tac aat teg 1395 
Asn Thr Ser Lys Gly Asp Lys Leu Gin Ser Pro Arg Phe Tyr Asn Ser 
450 455 460 

gag gga tat ggt ttt ggg gta act tta tac cca aat age aga gaa age 1443 
Glu Gly Tyr Gly Phe Gly Val Thr Leu Tyr Pro Asn Ser Arg Glu Ser 
465 470 475 

tct ggt tac ttg aga ctt get ttt cat gtg tgc agt ggg gag aac gat 14 91 
Ser Gly Tyr Leu Arg Leu Ala Phe His Val Cys Ser Gly Glu Asn Asp 
480 485 490 

get ate ctg gag tgg ccg gta gaa aac aga cag gtg ata att ace ate 1539 
Ala He Leu Glu Trp Pro Val Glu Asn Arg Gin Val He He Thr He 
495 500 505 510 

ctt gac cag gag cct gat gtc egg aac agg atg tec tea age atg gtg 1587 
Leu Asp Gin Glu Pro Asp Val Arg Asn Arg Met Ser Ser Ser Met Val 
515 520 525 

ttc act ace teg aag teg cac aca tct cca gcg ata aat gac act gtc 1635 
Phe Thr Thr Ser Lys Ser His Thr Ser Pro Ala He Asn Asp Thr Val 
530 535 540 

ate tgg gac agg ccg tec agg gtg gga ace tat cat aca gac tgt aat 1683 
He Trp Asp Arg Pro Ser Arg Val Gly Thr Tyr His Thr Asp Cys Asn 
545 550 555 

tgt ttt aga age ate gac ttg ggc tgg agt ggt ttc att tec cac caa 173i 
Cys Phe Arg Ser He Asp Leu Gly Trp Ser Gly Phe He Ser His Gin 
560 565 570 

atg ctg aaa agg agg agt ttc ctg aaa aat gat gac etc ate ata ttt 1779 
Met Leu Lys Arg Arg Ser Phe Leu Lys .Asn Asp Asp Leu He He Phe 
575 580 585 590 

gtg gac ttt gaa gat ate acc cac etc age cag act gaa gtt ccc tct 1827 
Val Asp Phe Glu Asp He Thr His Leu Ser Gin Thr Glu Val Pro Ser 
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595 600 605 

aaa ggc aaa aga ctg age ccc caa ggc etc att etc caa ggc cag gag 1875 

Lys Gly Lys Arg Leu Ser Pro Gin Gly Leu lie Leu Gin Gly Gin Glu 
610 615 620 

cag cag gtc tec gaa gaa ggt teg gga aag gec atg tta gag gaa gec 1923 

Gin Gin Val Ser Glu Glu Gly Ser Gly Lys Ala Met Leu Glu Glu Ala 

625 630 635 

eta cct gtc age ctg age cag ggg cag ccc age cga cag aag egg teg 1971 

Leu Pro Val Ser Leu Ser Gin Gly Gin Pro Ser Arg Gin Lys Arg Ser 
640 645 650 

gtg gag aac aca ggc ccc ctg gag gac cat aac tgg cca cag tac ttc 2019 

. Val Glu Asn Thr Gly Pro Leu Glu Asp His Asn Trp Pro Gin Tyr Phe 
655 660 665 670 

aga gac cca tgt gac cca aac cct tgc caa aat gac ggc ate tgt gtg • 2067 

Arg Asp Pro Cys Asp Pro Asn Pro Cys Gin Asn Asp Gly lie Cys Val 
675 680 685 

aac gtg aag ggg atg gcg age tgc agg tgc ate tct .gga cat get ttc 2115 

Asn Val Lys Gly Met Ala Ser Cys Arg Cys lie Ser Gly His Ala Phe 
690 695 700 

ttc tac acg ggg gag cgc tgt cag teg gec gag gtg cac ggc agt gtc 2163 

Phe Tyr Thr Gly Glu Arg Cys Gin Ser Ala Glu Val His Gly Ser Val 

705 710 715 

ctg ggc atg gtg ate gga ggc acg get ggc gtg ate ttc ttg ace ttc 2211 

Leu Gly Met Val lie Gly Gly Thr Ala Gly Val. lie Phe Leu Thr Phe 
720 725 730 

tec ate ate gec ate ctt tec caa agg cca agg aag tgacctgcct 2257 

Ser lie lie Ala lie Leu Ser Gin Arg Pro Arg Lys 



735 


740 


745 








gctggcattg 


gccagaccac 


agcagcacct 


cctccatgca 


ggecttaact 


ttcccatgtt 


2317 


caatgeagtt 


tggggcagct 


tttttatcag 


ecttgetttg 


gataggacct 


ccaaggacta 


2377 


agcctccagc 


cccatgtgtg 


acccttgtca 


tctctctgcc 


ccacataatt 


atgttacttt 


2437 


gctatgtgct 


cctaatgtat 


ctagtgtgtc 


ctgtgacaac 


actcatcaca 


cttcattgta 


2497 


aatcacttgt 


tttattgact 


gtctttccta 


tagactgtaa 


gctccatgag 


ggcaggcaca 


2557 


tgttgttctc 


attgacegtg 


ctggccccag 


tgcctagatg 


catggctggc 


acattgttgg 


2617 


cactcaacaa 


tggttgaatg 


aataaaacaa 


taaatgaatg 


aataactaag 


atatagaaac 


2677 


tctcatttat 


attgeagatt 


gaatatatat 


gatgaaattc 


ttatgttgaa 


tatgttagaa 


2737 


tcaaatactc 


atttttcatt 


agatacagta 


gtgtcatcac 


tcttttaaga 


tcttgttaaa 


2797 


gatttcaaat 


aaaggtactt 


ctggcgagcc 


aggctgeaca 


geatttgett 


tcctctgaga 


2857 


ttctaagaga 


aggectttaa 


taaatttaat 


aaatattgag 


ttagcaaaaa 


aaaaaaaaaa 


2917 


aaa 












2920 



<210> 62 
<211> 746 
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<212> PRT 

<213> NMJ)05588; meprin A, alpha 
<400> 62 

Met Ala Trp lie Arg Ser Thr Cys He Leu Phe Phe Thr Leu Leu Phe 
1 5 10 15 

Ala His He Ala Ala Val Pro He Lys His Leu Pro Glu Glu Asn Val 
20 25 30 



His Asp Ala Asp Phe Gly Glu Gin 
35 40 



Ala Ala Gly Leu Asp Leu Phe Gin 
50 55 



Arg Asn Gly Leu Arg Asp Pro Asn 
65 70 



Tyr He Leu Ala Asp Asn Leu Gly 
85 



Lys Asp He Ser Glu He Asn Leu 
45 

Gly Asp He Leu Leu Gin Lys Ser 
60 

Thr Arg Trp Thr Phe Pro He Pro 
75 80 

Leu Asn Ala Lys Gly Ala He Leu 
90 95 



Tyr Ala Phe Glu Met Phe Arg Leu Lys Ser Cys Val Asp Phe Lys Pro 
100 105 HO 



Tyr Glu Gly Glu Ser Ser Tyr He He Phe Gin Gin Phe Asp Gly Cys 
115 120 125 



Trp Ser Glu Val Gly Asp Gin His Val Gly Gin Asn He Ser He Gly 
130 135 140 

Gin Gly Cys Ala Tyr Lys Ala He He Glu His Glu He Leu His Ala 
145 150 155 160 

Leu Gly Phe Tyr His Glu Gin Ser Arg Thr Asp Arg Asp Asp Tyr Val 
165 170 " 175 

Asn He Trp Trp Asp Gin He Leu Ser Gly Tyr Gin His Asn Phe Asp 
180 185 190 

Thr Tyr Asp Asp Ser Leu He Thr Asp Leu Asn Thr Pro Tyr Asp Tyr 
195 200 205 

Glu Ser Leu Met His Tyr Gin Pro Phe Ser Phe Asn Lys Asn Ala Ser 
210 215 220 



Val Pro Thr He Thr Ala Lys He Pro Glu Phe Asn Ser He He Gly 
225 230 235 240 
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Gin Arg Leu Asp phe Ser Ala lie Asp Leu Glu Arg Leu Asn Arg Met 
245 250 255 



Tyr Asn Cys Thr Thr Thr His Thr Leu Leu Asp His Cys Thr Phe Glu 
260 265 270 



Lys Ala Asn lie Cys Gly Met He Gin Gly Thr Arg Asp Asp Thr Asp 
275 280 285 



Trp Ala His Gin Asp Ser Ala Gin Ala Gly Glu Val Asp His Thr Leu 
290 295 300 



Leu Gly Gin Cys Thr Gly Ala Gly Tyr Phe Met Gin Phe Ser Thr Ser 
305 310 • 315 320 



Ser Gly Ser Ala Glu Glu Ala Ala Leu Leu Glu Ser Arg He Leu Tyr 
325 330 335 



Pro Lys Arg Lys Gin Gin Cys Leu Gin Phe Phe Tyr Lys Met Thr Gly 
340 345 350 



Ser Pro Ser Asp Arg Leu Val Val Trp Val Arg Arg Asp Asp Ser Thr 
355 360 365 



Gly Asn Val Arg Lys Leu Val Lys Val Gin Thr Phe Gin Gly Asp Asp 
370 375 380 



Asp His Asn Trp Lys He Ala His Val Val Leu Lys Glu Glu Gin Lys 
385 390 395 400 



Phe Arg Tyr Leu Phe Gin Gly Thr Lys Gly Asp Pro Gin Asn Ser Thr 
405 410 415 



Gly Gly He Tyr Leu Asp Asp lie Thr Leu Thr Glu Thr Pro Cys Pro 
420 425 430 



Thr Gly Val Trp Thr Val Arg Asn Phe Ser Gin Val Leu Glu Asn Thr 
435 440 445 



Ser Lys Gly Asp Lys Leu Gin Ser Pro Arg Phe Tyr Asn Ser Glu Gly 
450 455 460 



Tyr Gly Phe Gly Val Thr Leu Tyr Pro Asn Ser Arg Glu Ser Ser Gly 
465 470 475 480 



Tyr Leu Arg Leu Ala Phe His Val Cys Ser Gly Glu Asn Asp Ala He 
485 490 495 



Leu Glu Trp Pro Val Glu Asn Arg Gin Val lie He Thr He Leu Asp 
500 505 510 
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Gin Glu Pro Asp Val Arg Asn Arg Met Ser Ser Ser Met Val Phe Thr 
515 520 525 



Thr Ser Lys Ser His Thr Ser Pro Ala He Asn Asp Thr Val He Trp 
530 535 540 



Asp Arg Pro Ser Arg Val Gly Thr Tyr His Thr Asp Cys Asn Cys Phe 
545 550 555 560 



Arg Ser He Asp Leu Gly Trp Ser Gly Phe He Ser His Gin Met Leu 
565 570 575 



Lys Arg Arg Ser Phe Leu Lys Asn Asp Asp Leu He He Phe Val Asp 
580 585 590 



Phe Glu Asp tie Thr His Leu Ser Gin Thr Glu Val Pro Ser Lys Gly 
595 600 605 



Lys Arg Leu Ser Pro Gin Gly Leu He Leu Gin Gly Gin Glu Gin Gin 
610 615 620 



Val Ser Glu Glu Gly Ser Gly Lys Ala Met Leu Glu Glu Ala Leu Pro 
625 630 635 640 



Val Ser Leu Ser Gin Gly Gin Pro Ser Arg Gin Lys Arg Ser Val Glu 
645 650 655 



Asn Thr Gly Pro Leu Glu Asp His Asn Trp Pro Gin Tyr Phe Arg Asp 
660 665 670 



Pro Cys Asp Pro Asn Pro Cys Gin Asn Asp Gly He Cys Val Asn Val 
675 680 685 



Lys Gly Met Ala Ser Cys Arg Cys He Ser Gly His Ala Phe Phe Tyr 
690 695 700 



Thr Gly Glu Arg Cys Gin Ser Ala Glu Val His Gly Ser Val Leu Gly 
705 710 715 720 



Met Val He Gly Gly Thr Ala Gly Val He Phe Leu Thr Phe Ser He 
725 730 735 



He Ala He Leu Ser Gin Arg Pro Arg Lys 
740 745 



<210> 63 

<211> 8838 

<212> DNA 

<213> NM_015902 EDD, hyperplastic discs protein, DD5 
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<220> 

<221> CDS 

<222> (34) . . (8430) 

<223> 



<400> 63 

cgccctcgag tggaggacga gaaggaaagc acc atg acg tec ate cat ttc gtg 54 

. Met Thr Ser He His Phe Val 
15 

gtt cac ccg ctg ccg ggc acc gag gac cag etc aat gac agg tta cga 102 
Val His Pro Leu Pro Gly Thr Glu Asp Gin Leu Asn Asp Arg Leu Arg 
10 15 20 

gaa gtt tct gag aag ctg aac aaa tat aat tta aac age cac ccc cct 150 
Glu Val Ser Glu Lys Leu Asn Lys Tyr Asn Leu Asn Ser His Pro Pro 
25 30 35 

ttg. aat gta ttg gaa cag get act att aaa cag tgt .gtg gtg gga cca 198 
Leu Asn Val Leu Glu Gin Ala Thr He Lys Gin Cys Val Val Gly Pro 
40 45 50 55 

aat cat get gee ttt ctt ctt gag gat ggt aga gtt tgc agg att ggt 246 
Asn His Ala Ala Phe Leu Leu Glu Asp Gly Arg Val Cys Arg He Gly 
60 65 70 

ttt tea gta cag cca gac aga ttg gaa ttg ggt aaa cct gat aat aat 294 
Phe Ser Val Gin Pro Asp Arg Leu Glu Leu Gly Lys Pro Asp Asn Asn 
75 80 85 

gat ggg tea aag ttg aac age aac teg ggg gca ggg agg acg tea agg 342 
Asp Gly Ser Lys Leu Asn Ser Asn Ser Gly Ala Gly Arg Thr Ser Arg 
90 95 100 

cct ggt agg aca age gac tct cca tgg ttt etc tea ggt tct gag act 390 
Pro Gly Arg Thr Ser Asp Ser Pro Trp Phe Leu Ser Gly Ser Glu Thr 
105 110 115 

eta ggc agg ctg gca ggc aac acc tta gga age cgc tgg agt tct gga 438 
Leu Gly Arg Leu Ala Gly Asn Thr Leu Gly Ser Arg Trp Ser Ser Gly 
120 125 130 135 

gtg ggt gga agt ggt gga gga tec tct ggt agg tea tea get gga get 486 
Val Gly Gly Ser Gly Gly' Gly Ser Ser Gly Arg Ser Ser Ala Gly Ala 
140 145 150 

cga gat tec cgc egg cag act cga gtt att egg aca gga egg gat cga 534 
Arg Asp Ser Arg Arg Gin Thr Arg Val He Arg Thr Gly Arg Asp Arg 
155 160 165 

ggg tct ggg ctt ttg ggc agt cag ccc cag cca gtt att cca gca tct 582 
Gly Ser Gly Leu Leu Gly Ser Gin Pro Gin Pro Val He Pro Ala Ser 
170 175 180 

gtc att cca gag gag ctg att tea cag gee caa gtt gtt tta caa ggc 630 
Val He Pro Glu Glu Leu He Ser Gin Ala Gin Val Val Leu Gin Gly 
185 190 195 

aaa tec aga agt gtc att att cga gaa ctt cag aga aca aat ctt gat 678 
Lys Ser Arg Ser Val He He Arg Glu Leu Gin Arg Thr Asn Leu Asp 
200 205 210 215 
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gtg aac ctt get gta aat aat tta ctt age egg- gat gat gaa gat gga 726 
Val Asn Leu Ala Val Asn Asn Leu Leu Ser Arg Asp Asp Glu Asp Gly 
220 -225 * 230 

gat gat ggg gat gat aca gee age gaa tct' tat ttg cct gga gag gat 774 
Asp Asp Gly Asp Asp Thr Ala Ser Glu Ser Tyr Leu Pro Gly Glu Asp 
235 240 245 

ctt atg tct etc ctt gat gee gac att cat tct gee cac cca agt gtc 822 
Leu Met Ser Leu Leu Asp Ala Asp lie His Ser Ala His Pro Ser Val 
250 255 260 

att att gat gca gat gec atg ttt tct gaa gac att age tat ttt ggt 870 
lie Xle Asp Ala Asp Ala Met Phe Ser Glu Asp lie Ser Tyr Phe Gly 
265 270 275 

tac cct tct ttt cgt cgt tea tea ctt tec agg eta ggc tea tct cga 918 
Tyr Pro Ser Phe Arg Arg Ser Ser Leu Ser Arg Leu Gly Ser Ser Arg 
280 285 290 295 

gtt etc ctt ctt ccc tta gag aga gac tct gag ctg ttg cgt gaa cgt 966 
Val Leu Leu Leu Pro Leu Glu Arg Asp Ser Glu Leu Leu Arg Glu Arg 
300 ~ 305 310 

gaa tec gtt tta cgt tta cgt gaa cga agg tgg ctt gat gga gee tea 1014 
Glu Ser Val Leu Arg Leu Arg Glu Arg Arg Trp Leu Asp Gly Ala Ser 
315 320 325 

ttt gat aat gaa agg ggt tct acc age aag gaa gga gag cca aac ttg 1062 
Phe Asp Asn Glu Arg Gly Ser Thr Ser Lys Glu Gly Glu Pro Asn Leu 
330 335 340 

gat aag aag aat aca cct gtt caa agt cca gta tct eta gga gaa gat 1110 
Asp Lys Lys Asn Thr Pro Val Gin Ser Pro Val Ser Leu Gly Glu Asp 
345 350 355 

ttg cag tgg tgg cct gat aag gat gga aca aaa ttc ate tgt att ggg 1158 
Leu Gin Trp Trp Pro Asp Lys Asp Gly Thr Lys Phe lie Cys He Gly 
360 " 365 ~ 370 375 

get ctg tat tct gaa ctt ctg get gtc age agt aaa gga gaa ctt tat 1206 
Ala Leu Tyr Ser Glu Leu Leu Ala Val Ser Ser Lys Gly Glu Leu Tyr 
380 385 390 

cag tgg aaa tgg agt gaa tct gag cct tac aga aat gee cag aat cct 1254 
Gin Trp Lys Trp Ser Glu Ser Glu Pro Tyr Arg Asn Ala Gin Asn Pro 
395 400 405 

tea tta cat cat cca cga gca aca ttt ttg ggg tta acc aat gaa aag 1302 
Ser Leu His His Pro Arg Ala Thr Phe Leu Gly Leu Thr Asn Glu Lys 
410 415 420 

ata gtc etc ctg tct gca aat age ata aga gca act gta get aca gaa 1350 
He Val Leu Leu Ser Ala Asn Ser lie Arg Ala Thr Val Ala Thr Glu 
425 430 435 

aat aac aag gtt get aca tgg gtg gat gaa act tta agt tct gtg get 1398 
Asn Asn Lys Val Ala Thr Trp Val Asp Glu Thr Leu Ser Ser Val Ala 
440 445 450 455 

tct aaa tta gag cac act get cag act tac tct gaa ctt caa gga gag 1446 
Ser Lys Leu Glu His Thr Ala Gin Thr Tyr Ser Glu Leu Gin Gly Glu 
460 465 470 

egg ata gtt tct tta cat tgc tgt gee ctt tac acc tgc get cag ctg 1494 
Arg He Val Ser Leu His Cys Cys Ala Leu Tyr Thr Cys Ala Gin Leu 
475 480 485 
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gaa aac agt tta tat tgg "tgg ggt gta gtt cct ttt agt caa agg aag 1542 
Glu Asn Ser Leu Tyr Trp Trp Gly Val Val Pro Phe Ser Gin Arg Lys 
490 495 500 

aaa atg tta gag aaa get aga gca aaa .aat aaa aag cct aaa tec agt 1590 
Lys Met Leu Glu Lys Ala Arg Ala Lys Asn Lys Lys Pro Lys Ser Ser 
505 510 - 515 

get ggt att tct tea atg ccg aac ate act gtt ggt ace cag gta tgc 1638 
Ala Gly lie Ser Ser Met Pro Asn lie Thr Val Gly Thr Gin Val Cys 
520 525 530 535 

ttg aga aat aat cct ctt tat cat get gga gca gtt gca ttt tea att 1686 
Leu Arg Asn Asn Pro Leu Tyr His Ala Gly Ala Val Ala Phe Ser lie 
540 545 550 

agt get ggg att cct aaa gtt ggt gtc tta atg gag tea gtt tgg aat 1734 
Ser Ala Gly He Pro Lys Val Gly Val Leu Met Glu Ser Val Trp Asn 
555 560 565 

atg aat gac age tgt aga ttt caa ctt aga tct cct gaa age ttg aaa 1782 
Met Asn Asp Ser Cys Arg Phe Gin Leu Arg Ser Pro Glu Ser Leu Lys 
570 575 580 

aac atg gaa aaa get age aaa act act gaa get aag cct gaa agt aag 1830 
Asn Met Glu Lys Ala Ser Lys Thr Thr Glu Ala Lys Pro Glu Ser Lys 
585 590 595 

cag gag cca gtg aaa aca gaa atg ggt cct. cca cca tct cca gca tec 1878 
Gin Glu Pro Val Lys Thr Glu Met Gly Pro Pro Pro Ser Pro Ala Ser 
600 605 610 615 

acg tgt agt gat gca tec tea att gee age agt gca tea atg cca tac 1926 
Thr Cys Ser Asp Ala Ser Ser He Ala Ser Ser Ala Ser Met Pro Tyr 
620 625 630 

aaa cga cga egg tea acc cct gca cca aaa gaa gag gaa aag gtg aat 1974 
Lys Arg Arg Arg Ser Thr Pro Ala Pro Lys Glu Glu Glu Lys Val Asn 
635 640 645 

gaa gag cag tgg tct ctt egg gaa gtg gtt ttt gtg gaa gat gtc aag 2022 1 
Glu Glu Gin Trp Ser Leu Arg Glu Val Val Phe Val Glu Asp Val Lys 
650 655 660 

aat gtt cct gtt ggc aag gtg eta aaa gta gat ggt gee tat gtt get 2070 
Asn Val Pro Val Gly Lys Val Leu Lys Val Asp Gly Ala Tyr Val Ala 
665 670 675 

gta aaa ttt cca gga acc tec agt aat .act aac tgt cag aac age tct 2118 
Val Lys Phe Pro Gly Thr Ser Ser Asn Thr Asn Cys Gin Asn Ser Ser 
680 685 690 695 

ggt cca gat get gac cct tct tct etc ctg cag gat tgt agg tta ctt 2166 
Gly Pro Asp Ala Asp Pro Ser Ser Leu Leu Gin Asp Cys Arg Leu Leu 
700 705 710 

aga att gat gaa ttg cag gtt gtc aaa act ggt gga aca ccg aag gtt 2214 
Arg He Asp Glu Leu Gin Val Val Lys Thr Gly Gly Thr Pro Lys Val 
715 720 725 

ccc gac tgt ttc caa agg act cct aaa aag ctt tgt ata cct gaa aaa 2262 
Pro Asp Cys Phe Gin Arg Thr Pro Lys Lys Leu Cys He Pro Glu Lys 
730 735 740 

aca gaa ata tta gca gtg aat gta gat tec aaa ggt gtt cat get gtt 2310 
Thr Glu He Leu Ala Val Asn Val Asp Ser Lys Gly Val His Ala Val 
745 750 755 
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ctg aag act gga aat tgg gtg cga tac tgt ate ttt gat ctt get aca 2358 
Leu Lys Thr Gly Asn Trp Val Arg Tyr Cys lie Phe Asp Leu Ala Thr 
760 765 770 775 

gga aaa gca gaa cag gaa aat aat ttt cct aca age age att get ttc 2406 
Gly Lys Ala Glu Gin Glu Asn Asn Phe Pro Thr Ser Ser lie Ala Phe 
780 785 790 

ctt ggt cag aat gag agg aat gta gec att ttc act get gga cag gaa 2454 
Leu Gly Gin Asn Glu Arg Asn Val Ala He Phe Thr Ala Gly Gin Glu 
795 800 805 

tct ccc att att ctt cga gat gga aat ggt acc ate tac cca atg gee 2502 
Ser Pro He He Leu Arg Asp Gly Asn Gly Thr He Tyr Pro Met Ala 
810 815 820 

aaa gat tgc atg gga gga ata agg gat ccc gat tgg ctg gat ctt cca 2550 
Lys Asp Cys Met Gly Gly He Arg Asp Pro Asp Trp Leu Asp Leu Pro 
825 830 835 

- cct att agt agt ctt gga atg ggt gtg cat tct tta ata aat ctt cct 2598 
Pro He Ser Ser Leu Gly Met Gly Val His Ser Leu He Asn Leu Pro 
840 845 850 855 

gec aat tea aca ate aaa aag aaa get get gtt ate ate atg get gta 2646 
Ala Asn Ser Thr He Lys Lys Lys Ala Ala Val He He Met Ala Val 
860 865 870 

gag aaa caa acc tta atg caa cac att ctg cgc tgt gac tat gag gec 2694 
Glu Lys Gin Thr Leu Met Gin His He Leu Arg Cys Asp Tyr Glu Ala 
875 880 885 

tgt cga caa tat eta atg aat ctt gag caa gcg gtt gtt tta gag cag 2742 
Cys Arg Gin Tyr Leu Met Asn Leu Glu Gin Ala Val Val Leu Glu Gin 
890 895 900 

aat eta cag atg ctg cag aca ttc ate age cac aga tgt gat gga aat 2790 
Asn Leu Gin Met Leu Gin Thr Phe He Ser His Arg Cys Asp Gly Asn 
905 910 915 

cga aat att ttg cat get tgt gta tea gtt tgc ttt cca acc age aat 2838 
Arg Asn He Leu His Ala Cys Val Ser Val Cys Phe Pro Thr Ser Asn 
920 925 930 935 

aaa gaa act aaa gaa gaa gag gaa gcg gag cgt tct gaa aga aat aca 2886 
Lys Glu Thr Lys Glu Glu Glu Glu Ala Glu Arg Ser Glu Arg Asn Thr 
940 '945 950 

ttt gca gaa agg ctt tct get gtt gag gec att gca aat gca ata tea 2934 
Phe Ala Glu Arg Leu Ser Ala Val Glu Ala He Ala Asn Ala He Ser 
955 960 965 

gtt gtt tea agt aat ggc cca ggt aat egg get gga tea tea agt age 2982 
Val Val Ser Ser Asn Gly Pro Gly Asn Arg Ala Gly Ser Ser Ser Ser 
970 975 980 

cga agt ttg aga tta egg gaa atg atg aga cgt teg ttg aga gca get 3030 
Arg Ser Leu Arg Leu Arg Glu Met Met Arg Arg Ser Leu Arg Ala Ala 
985 990 995 

ggt ttg ggt aga cat gaa get gga get tea tec agt gac cac cag 3075 
Gly Leu Gly Arg His. Glu Ala Gly Ala Ser Ser Ser Asp His Gin 
1000 1005 1010 

gat cca gtt tea ccc ccc ata get ccc cct agt tgg gtt cct gac 3120 
Asp Pro Val Ser Pro Pro He Ala Pro Pro Ser Trp Val Pro Asp 
1015 1020 1025 
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cct cct gcg atg gat cct gat ggt gac att gat ttt ate ctg gec 3165 
Pro Pro Ala Met Asp Pro Asp Gly Asp He Asp Phe He Leu Ala 
1030 1035 ' 1040 

ccc get gtg gga tct ctt ace aca gca gca acc ggt act ggt caa 3210 
Pro Ala Val Gly Ser Leu Thr Thr Ala Ala Thr Gly Thr Gly Gin 
1045 1050 1055 

gga cca age acc tec act att cca ggt cct tec aca gag cca tct 3255 
Gly Pro Ser Thr Ser Thr He Pro Gly Pro Ser Thr Glu Pro Ser 
1060 1065 . 1070 

gta gta gaa tec aag gat cga aag gcg aat get cat ttt ata ttg 3300 
Val Val Glu Ser Lys Asp Arg Lys Ala Asn Ala His Phe He Leu 
1075 1080 1085 

aaa ttg tta tgt gac agt gtg gtt etc cag ccc tat eta cga gaa 3345 
Lys Leu Leu Cys Asp Ser Val Val Leu Gin Pro Tyr Leu Arg Glu 
1090 1095 1100 

ctt ctt tct gee aag gat gca aga ggg atg acc cca ttt atg tea 3390 
Leu Leu Ser Ala Lys Asp Ala Arg Gly Met Thr Pro Phe Met Ser 
1105 1110 H15 

get gta agt ggc cga get tat cct get gca att acc ate tta gaa 3435 
Ala Val Ser Gly Arg Ala Tyr Pro Ala Ala He ' Thr He Leu Glu 
1120 1125 1130 

act get cag aaa att gca aaa get gaa ata tec tea agt gaa aaa 3480 
Thr Ala Gin Lys He Ala Lys Ala Glu He Ser Ser Ser Glu Lys 
1135 1140 1145 

gag gaa gat gta ttc atg gga atg gtt tgc cca tea ggt acc aac 3525 
Glu Glu Asp Val Phe Met Gly Met Val Cys Pro Ser Gly Thr Asn 
1150 1155 1160 

cct gat gac tct cct tta tat gtt tta tgt tgt aat gac act tgc 3570 
Pro Asp Asp Ser Pro Leu Tyr Val Leu Cys Cys Asn Asp Thr Cys 
1165 1170 1175 

agt ttt aca tgg act gga gca gag cac att aac cag gat att ttt 3615 
Ser Phe Thr Trp Thr Gly ' Ala Glu His He Asn Gin Asp He Phe 
1180 H85 1190 

gag tgt cga act tgt ggc ttg ctg gag tea ctg tgt tgt tgt acg 3660 
Glu Cys Arg Thr Cys Gly Leu Leu Glu Ser Leu Cys Cys Cys Thr 
1195 1200 1205 

gaa tgt gca agg gtt tgt cat aaa ggt cat gat tgc aaa etc aaa 3705 
Glu Cys Ala Arg Val Cys His Lys Gly His Asp Cys Lys Leu Lys 
1210 1215 1220 

egg aca tea cca aca gee tac tgt gat tgt tgg gag aaa tgt aaa 3750 
Arg Thr Ser Pro Thr Ala Tyr Cys Asp Cys Trp Glu Lys Cys Lys 
1225 1230 1235 

tgt aaa act ctt att get gga cag aaa tct get cgt ctt gat eta 3795 
Cys Lys Thr Leu He Ala Gly Gin Lys Ser Ala Arg Leu Asp Leu 
1240 1245 1250 

ctt tat cgc ctg etc act get act aat ctg gtt act ctg cca aac 3840 
Leu Tyr Arg Leu Leu Thr Ala Thr Asn Leu Val Thr Leu Pro Asn 
1255 1260 1265 

age agg gga gag cac etc tta eta ttc tta gta cag aca gtc gca 3885 
Ser Arg Gly Glu His Leu Leu Leu Phe Leu Val Gin Thr Val Ala 
1270 1275 1280 
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agg cag acg gtg gag cat tgt caa tac agg cca cct cga ate agg 
Arg Gin Thr Val Glu His Cys Gin Tyr Arg Pro Pro Arg He Arg 
1285 1290 1295 



3930 



gaa gat cgt aac cga aaa aca gec agt cct gaa gat tea gat atg 3975 

Glu Asp Arg Asn Arg Lys Thr Ala Ser Pro Glu Asp Ser Asp Met 
1300 1305 1310 

cca gat cat gat tta gag cct cca aga ttt gec cag ctt gca ttg 4020 

Pro Asp His Asp Leu Glu Pro Pro Arg Phe Ala Gin Leu Ala Leu 
1315 1320 1325 



gag cgt gtt eta cag gac 
Glu Arg Val Leu Gin Asp 
1330 1335 



tgg aat gec ttg aaa 
Trp Asn Ala Leu Lys 
■ 1340 



tct atg att atg 
Ser Met He Met 



4065 



ttt ggg teg cag gag aat aaa gac cct ctt agt gee age agt aga 
Phe Gly Ser Gin Glu Asn Lys Asp Pro Leu Ser Ala Ser Ser Arg 
1345 1350 1355 



4110 



ata ggc cat ctt ttg cca gaa gag caa gta tac 
He Gly His Leu Leu. Pro Glu Glu Gin Val Tyr 
1360 1365 1370 



etc aat cag caa 
Leu Asn Gin Gin 



4155 



agt ggc aca att egg ctg gac tgt ttc act cat tgc ctt ata gtt 
Ser Gly Thr He Arg Leu Asp Cys Phe Thr His Cys Leu He Val 
1375 1380 1385 



4200 



aag tgt aca gca gat att ttg ctt tta gat act eta eta ggt aca 
Lys Cys Thr Ala Asp He Leu Leu Leu Asp Thr Leu Leu Gly Thr 
1390 1395 1400 



4245 



eta gtg aaa gaa etc caa aac aaa tat aca cct gga cgt aga gaa 
Leu Val Lys Glu Leu Gin Asn Lys Tyr Thr Pro Gly Arg Arg Glu 
1405 1410 1415 

gaa get att get gtg aca atg agg ttt eta cgt tea gtg gca aga 
Glu Ala He Ala Val Thr Met Arg Phe Leu Arg Ser Val Ala Arg 
1420 1425 ~ 1430 

gtt ttt gtt att ctg agt gtg gaa atg get tea tec aaa aag aaa 
Val Phe Val He Leu Ser Val Glu Met Ala Ser Ser Lys Lys Lys 
1435 1440 1445 



4290 



4335 



4380 



aac aac ttt att cca cag cca att gga aaa tgc aag cgt gta ttc 
Asn Asn Phe He Pro Gin Pro He Gly Lys Cys Lys Arg Val Phe 
1450 1455 1460 



4425 



caa gca ttg eta cct tac get gtg gaa gaa ttg 
Gin Ala Leu Leu Pro Tyr Ala Val Glu Glu Leu 
1465 1470 1475 



tgc aac gta gca 
Cys Asn Val Ala 



4470 



gag tea ctg att gtt cct 
Glu Ser Leu He Val Pro 
1480 1485 



gtc aga atg ggg att get cgt cca act 
Val Arg Met Gly He Ala Arg Pro Thr 
1490 



4515 



gca cca ttt ace ctg get agt act age ata gat 
Ala Pro Phe Thr Leu Ala Ser Thr Ser He Asp 
1495 1500 1505 



gec atg cag ggc 
Ala Met Gin Gly 



4560 



agt gaa gaa tta ttt tea 
Ser Glu Glu Leu Phe Ser 
1510 1515 



gtg gaa cca eta cca 
Val Glu Pro- Leu Pro 
1520 



cca cga cca tea 
Pro Arg Pro Ser 



4605 



tct gat cag tct age age tec agt cag tct cag tea tec tac ate 
Ser Asp Gin Ser Ser Ser Ser Ser Gin Ser Gin Ser Ser Tyr He 
1525 1530 1535 



4650 
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ate agg aat cca cag cag agg cgc ate age cag tea cag ccc gtt 
He Arg Asn Pro Gin Gin Arg Arg He Ser Gin Ser Gin Pro Val 
1540 1545 1550 



4695 



egg ggc aga gat gaa gaa cag gat gat att gtt 
Arg Gly Arg Asp Glu Glu Gin Asp Asp He Val 
1555 1560 1565 



tea gca gat gtg 
Ser Ala Asp Val 



4740 



gaa gag gtt gag gtg gtg gag ggt gtg get gga gaa gag gat cat 
Glu Glu Val Glu Val Val Glu Gly Val Ala Gly Glu Glu Asp His 
1570 1575 1580 

cat gat gaa cag gaa gaa cac ggg gaa gaa aat get gag gca gag 
His Asp Glu Gin Glu Glu His Gly Glu Glu Asn Ala Glu Ala Glu 
1585 1590 1595 



4785 



4830 



gga caa cat gat gag cat gat gaa gac ggg agt gat atg gag ctg 4875 
Gly Gin His Asp Glu His Asp Glu Asp Gly Ser Asp Met Glu Leu 
1600 1605 1610 

gac ttg tta gca gca get gaa aca gaa agt gat agt gaa agt aac 4920 
Asp Leu Leu Ala Ala Ala Glu Thr Glu Ser Asp Ser Glu Ser Asn 
1615 1620 1625 



cac age aac caa gat aat get agt ggg cgc aga age gtt gtc act 
His Ser Asn Gin Asp Asn Ala Ser Gly Arg Arg Ser Val Val Thr 
1630 1635 1640 



4965 



gca gca act get ggt tea gaa gca gga gca age 
Ala Ala Thr Ala Gly Ser Glu Ala Gly Ala Ser 
1645 1650 1655 



agt gtt cct gee 
Ser Val Pro Ala 



ttc ttt tct gaa gat gat tct caa teg aat gac tea agt gat tct 
Phe Phe Ser Glu Asp Asp Ser Gin Ser Asn Asp Ser Ser Asp Ser 
1660 1665 1670 



5010 



5055 



gat age agt agt agt cag agt gac gac ata gaa 
Asp Ser Ser Ser Ser Gin Ser Asp Asp He Glu 
1675 1680 * 1685 



cag gag ace ttt 
Gin Glu Thr Phe 



atg ctt gat gag cca tta gaa aga acc aca aat age tec cat gee 
Met Leu Asp Glu Pro Leu Glu Arg Thr Thr Asn Ser Ser His Ala 
1690 1695 1700 



aat ggt get gee caa get ccc cgt tea atg cag 
Asn Gly Ala Ala Gin Ala Pro Arg Ser Met Gin 
1705 1710 1715 



tgg get gtc cgc 
Trp Ala Val Arg 



aac acc cag cat cag cga gca gee agt aca gee cct tec agt aca 
Asn Thr Gin His Gin Arg Ala Ala Ser Thr Ala Pro Ser Ser Thr 
1720 1725 1730 



5100 



5145 



5190 



5235 



tct aca cca gca gca agt tea gcg' ggt ttg att 
Ser Thr Pro Ala Ala Ser Ser Ala Gly Leu He 
1735 1740 1745 



tat att gat cct 
Tyr He Asp Pro 



tea aac tta cgc egg agt ggt acc ate agt aca agt get gca get 
Ser Asn Leu Arg Arg Ser Gly Thr He Ser Thr Ser Ala Ala Ala 
1750 1755 1760 



5280 



5325 



gca gca get get ttg gaa get age aac gee age 
Ala Ala Ala Ala Leu Glu Ala Ser Asn Ala Ser 
1765 1770 1775 



agt tac eta aca 
Ser Tyr Leu Thr 



5370 



tct gca age agt tta gee agg get tac age att gtc att aga caa 
Ser Ala Ser Ser Leu Ala Arg Ala Tyr Ser lie Val He Arg Gin 
1780 1785 1790 



5415 
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ate teg gac ttg atg ggc ctt att cct aag tat aat cac eta gta 5460 
He Ser Asp Leu Met Gly Leu He Pro Lys Tyr Asn His Leu Val 
1795 1800 1805 

tac tct cag att cca gca get gtg aaa ttg act tac caa gat gca 5505 
Tyr Ser Gin He Pro Ala Ala Val Lys Leu Thr Tyr Gin Asp Ala 
1810 1815 1820 



gta aac tta cag aac tat gta gaa gaa aag ctt att ccc act tgg 
Val Asn Leu Gin Asn Tyr Val Glu Glu Lys Leu He Pro Thr Trp 
1825 1830 1835 

aac tgg atg gtc agt att atg gat tct act gaa get caa tta cgt 
Asn Trp Met Val Ser He Met Asp Ser Thr Glu Ala Gin Leu Arg 
1840 1845 1850 

tat ggt tct gca tta gca tct get ggt gat cct gga cat cca aat 
Tyr Gly Ser Ala Leu Ala Ser Ala Gly Asp Pro Gly His Pro Asn 
1855 i860 1865 

cat cct ctt cac get tct cag aat tea gcg aga ■ aga gag agg atg 
His Pro Leu His Ala Ser Gin Asn Ser Ala Arg Arg Glu Arg Met 
1870 1875 1880 



5550 



5595 



5640 



5685 



act gcg cga gaa gaa get age tta cga aca ctt gaa ggc aga cga 
Thr Ala Arg Glu Glu Ala Ser Leu Arg Thr Leu Glu Gly Arg Arg 
1885 1890 1895 



5730 



cgt gec acc ttg ctt age gee cgt caa gga atg 
Arg Ala Thr Leu Leu Ser Ala Arg Gin Gly Met 
1900 1905 1910 



atg tct gca cga 
Met Ser Ala Arg 



gga gac ttc eta aat tat get ctg tct eta atg egg tct cat aat 
Gly Asp Phe Leu Asn Tyr Ala Leu Ser Leu Met Arg Ser His Asn 
1915 1920 1925 

gat gag cat tct gat gtt ctt cca gtt ttg gat gtt tgc tea ttg 
Asp Glu His Ser Asp Val Leu Pro Val Leu Asp Val Cys Ser Leu 
1930 1935 1940 



5775 



5820 



5865 



aag cat gtg gca tat gtt ttt caa gca ctt ata 
Lys His Val Ala Tyr Val Phe Gin Ala Leu lie 
1945 1950 1955 

gca atg aat cag cag aca aca ttg gat aca cct 
Ala Met Asn Gin Gin Thr Thr Leu Asp Thr Pro 
I960 1965 . 1970 



tac tgg att aag 
Tyr Trp He Lys 



caa eta gaa cgc 
Gin Leu Glu Arg 



5910 



5955 



aaa agg acg cga gaa etc ttg gaa ctg ggt att 
Lys Arg Thr Arg Glu Leu Leu Glu Leu Gly He 
1975 1980 1985 



gat aat gaa gat 
Asp Asn Glu Asp 



6000 



tea gaa cat gaa aat gat gat gac acc aat caa agt get act ttg 
Ser Glu His Glu Asn Asp Asp Asp Thr Asn Gin Ser Ala Thr Leu 
1990 1995 2000 



6045 



aat gat aag gat gat gac tct ctt cct gca gaa act ggc caa aac 
Asn Asp Lys Asp Asp Asp Ser Leu Pro Ala Glu Thr Gly Gin Asn 
2005 2010 2015 



6090 



cat cca ttt ttc cga cgt 
His Pro Phe Phe Arg Arg 
2020 2025 



tea gac tec atg aca ttc ctt ggg tgt 
Ser Asp Ser Met Thr Phe Leu. Gly Cys 
2030 



6135 



ata ccc cca aat cca ttt gaa gtg cct ctg get gaa gee ate ccc 
He Pro Pro Asn Pro Phe Glu Val Pro Leu Ala Glu Ala He Pro 
2035 2040 2045 



6180 
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ttg get gat cag cca cat ctg ttg cag cca aat get aga aag gag 6225 
Leu Ala Asp Gin Pro His Leu Leu Gin Pro Asn Ala Arg Lys Glu 
2050 2055 2060 

gat ctt ttt ggc cgt cca agt cag ggt ctt tat tct tea tct gec 6270 
Asp Leu Phe Gly Arg Pro Ser Gin Gly Leu Tyr Ser Ser Ser Ala 
2065 2070 2075 

agt agt ggg aaa tgt tta atg gag gtt aca gtg gat aga aac tgc 6315 
. Ser Ser Gly Lys Cys Leu Met Glu Val Thr Val Asp Arg Asn Cys 
2080 2085 2090 

eta gag gtt ctt cca aca aaa atg tct tat get gee aat ctg aaa 6360 
Leu Glu Val Leu Pro Thr Lys Met Ser Tyr Ala Ala Asn Leu Lys 
2095 2100 2105 

aat gta atg aac atg caa aac egg caa aaa aaa gaa ggg gaa gaa 6405 
Asn Val Met Asn Met Gin Asn Arg Gin Lys Lys Glu Gly Glu Glu 
2110 2115 2120 

cag ccc gtg ctg cca gaa gaa act gag agt tea aaa cca ggg cca 6450 
Gin Pro Val Leu. Pro Glu Glu- Thr Glu Ser Ser Lys Pro Gly Pro 
2125 2130 2135 

tct get cat gat ctt get gca caa tta aaa agt age tta eta gca 6495 
Ser Ala His Asp Leu Ala Ala Gin Leu Lys Ser Ser Leu Leu Ala 
2140 2145 2150 

gaa ata gga ctt act gaa agt gaa ggg cca cct etc aca tct ttc 6540 
Glu lie Gly Leu Thr Glu Ser Glu Gly Pro Pro Leu Thr Ser Phe 
2155 2160 2165 

agg cca cag tgt age ttt atg gga atg gtt att tec cat gat atg 6585 
Arg Pro Gin Cys Ser Phe Met Gly Met Val lie Ser His Asp Met 
2170 2175 2180 

ctg eta gga cgt tgg cgc . ctt tct tta gaa ctg ttc ggc agg gta 6630 
Leu Leu Gly Arg Trp Arg Leu Ser Leu Glu Leu Phe Gly Arg Val 
2185 2190 2195 

ttc atg gaa gat gtt gga gca gaa cct gga tea ate eta act gaa 6675 
Phe Met Glu Asp Val Gly Ala Glu Pro Gly Ser He Leu Thr Glu 
2200 2205 2210 

ttg ggt ggt ttt gag gta aaa gaa tea aaa ttc cgc aga gaa atg 6720 
Leu Gly Gly Phe Glu Val Lys Glu Ser Lys Phe Arg Arg Glu Met 
2215 2220 2225 

gaa aaa ctg aga aac cag cag tea aga gat ttg tea eta gag gtt 6765 
Glu Lys Leu Arg Asn Gin Gin Ser Arg Asp Leu Ser Leu Glu Val 
2230 2235 . 2240 

gat egg gat cga gat ctt etc att cag cag act atg agg cag ctt 6810 
Asp Arg Asp Arg Asp Leu Leu He Gin Gin Thr Met Arg Gin Leu 
2245 2250 2255 

aac aat cac ttt ggt cga aga tgt get act aca cca atg get gta 6855 
Asn Asn His Phe Gly Arg Arg Cys Ala Thr Thr Pro Met Ala Val 
2260 2265 2270 

cac aga gta aaa gtc aca ttt aag gat gag cca gga gag ggc agt 6900 
His Arg Val Lys Val Thr Phe Lys Asp Glu Pro Gly Glu Gly Ser 
2275 2280 2285 



ggt gta gca cga agt ttt tat aca gee att gca caa gca ttt tta 
Gly Val Ala Arg Ser Phe Tyr Thr Ala He Ala Gin Ala Phe Leu 
2290 2295 2300 



6945 



WO 2004/022778 

169 



tea aat gaa aaa ttg cca aat eta gag tgt ate caa aat gee aac 
Ser Asn Glu Lys Leu Pro Asn Leu Glu Cys He Gin Asn Ala Asn 
2305 2310 ~ 2315 

aaa ggc acc cac aca agt tta atg cag aga tta agg aac cga gga 
Lys Gly Thr His Thr Ser Leu Met Gin Arg Leu Arg Asn Arg Gly 
2320 2325 2330 

gag aga gac egg gaa agg gag aga gaa agg gaa atg agg agg agt 
Glu Arg Asp Arg Glu Arg Glu Arg Glu Arg Glu Met Arg Arg Ser 
2335 2340 2345 

agt ggt ttg cga gca ggt tct egg agg gac egg gat aga gac ttt 
Ser Gly Leu Arg Ala Gly Ser Arg Arg Asp Arg Asp Arg Asp Phe 
2350 2355 ~ 2360 

aga aga cag ctt tec ate gac act agg ccc ttt aga cca gee tct 
Arg Arg Gin Leu Ser He Asp Thr Arg Pro Phe Arg Pro Ala Ser 
2365 2370 2375 

gaa ggg aat cct age gat gat cct gag cct ttg cca gca cat egg 
Glu Gly Asn Pro Ser Asp Asp Pro Glu Pro Leu Pro Ala His Arg 
2380 2385 2390 

cag gca ctt gga gag agg ctt. tat cct cgt gta caa gca atg caa 
Gin Ala Leu Gly Glu Arg Leu Tyr Pro Arg Val Gin Ala Met Gin 
2395 2400 2405 

cca gca ttt gca agt aaa ate act ggc atg ttg ttg gaa tta tec 
Pro Ala Phe Ala Ser Lys He Thr Gly Met Leu Leu Glu Leu Ser 
2410 2415 2420 

cca get cag ctg ctt etc ctt eta gca agt gag gat tct ctg aga 
Pro Ala Gin Leu Leu Leu Leu Leu Ala Ser Glu Asp Ser Leu Arg 
2425 2430 2435 



gca aga gtg gat gag gee atg gaa etc att att gca cat gga egg 
Ala Arg Val Asp Glu Ala Met Glu Leu He He Ala His Gly Arg 
2440 2445 2450 

gaa aat gga get gat agt ate ctg gat ctt gga tta gta gac tec 
Glu Asn Gly Ala Asp Ser He Leu Asp Leu Gly Leu Val Asp Ser 
2455 2460 2465 

tea gaa aag gta cag cag gaa aac cga aag cgc cat ggc tct agt 
Ser Glu Lys Val Gin Gin Glu Asn Arg Lys Arg His Gly Ser Ser 
2470 2475 2480 

cga agt gta gta gat atg gat tta gat gat aca gat gat ggt gat 
Arg Ser Val Val Asp Met Asp Leu' Asp Asp Thr Asp Asp Gly Asp 
2485 2490 2495 

gac aat gee cct ttg ttt tac caa cct ggg aaa aga gga ttt tat 
Asp Asn Ala Pro Leu Phe Tyr Gin Pro Gly Lys Arg Gly Phe Tyr 
2500 2505 2510 

act cca agg cct ggc aag aac aca gaa gca agg ttg aat tgt ttc 
Thr Pro Arg Pro Gly Lys Asn Thr Glu Ala Arg Leu Asn Cys Phe 
2515 2520 2525 

aga aac att ggc agg att ctt gga eta tgt ctg tta cag aat gaa 
Arg Asn He Gly Arg He Leu Gly Leu Cys Leu Leu Gin Asn Glu 
2530 2535 2540 

etc tgt cct ate aca ttg aat aga cat gta att aaa gta ttg ctt 
Leu Cys Pro He Thr Leu Asn Arg Hi's Val He Lys Val Leu Leu 
2545 2550 2555 
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ggt aga aaa gtc aat tgg cat gat ttt get ttt ttt gat cct gta 
Gly Arg Lys Val Asn Trp His Asp Phe Ala Phe Phe Asp Pro Val 
2560 2565 2570 

atg tat gag agt ttg egg caa eta ate etc gcg tct cag agt tea 
Met Tyr Glu Ser Leu Arg Gin Leu He Leu Ala Ser Gin Ser Ser 
2575 2580 . 2585 

gat get gat get gtt ttc tea gca atg gat ttg gca ttt gca att 
Asp Ala Asp Ala Val Phe Ser Ala Met Asp Leu Ala Phe Ala He 
2590 2595 2600 

gac ctg tgt aaa gaa gaa ggt gga gga cag gtt gaa etc att cct 
Asp Leu Cys Lys Glu Glu Gly Gly Gly Gin Val Glu Leu He Pro 
2605 2610 2615 

aat ggt gta aat ata cca gtc act cca cag aat gta tat gag tat 
Asn Gly Val Asn He Pro Val Thr Pro Gin Asn Val Tyr Glu Tyr 
2620 2625 2630 

gtg egg aaa tac gca gaa cac aga atg ttg gta gtt gca gaa cag 
Val Arg Lys Tyr Ala Glu His Arg Met Leu Val Val Ala Glu Gin 
2635 2640 2645 

ccc tta cat gca atg agg aaa ggt eta eta gat gtg ctt cca aaa 
Pro Leu His Ala Met Arg Lys Gly Leu Leu Asp Val Leu Pro Lys 
2650 2655 2660 



aat tea tta gaa gat tta acg gca gaa gat ttt 
Asn Ser Leu Glu Asp Leu Thr Ala Glu Asp Phe 
2665 2670 ' 2675 

aat ggc tgc ggt gaa gtc aat gtg caa atg ctg 
Asn Gly Cys Gly Glu Val Asn Val Gin Met Leu 
2680 2685 2690 



agg ctt ttg gta 
Arg Leu Leu Val 



ate agt ttt acc 
He Ser Phe Thr 



tct ttc aat gat gaa tea 
Ser Phe Asn Asp Glu Ser 
2695 2700 

ttc aag cgt tgg ttc tgg 
Phe Lys Arg Trp Phe Trp 
2710 2715 



gga gaa aat get gag aag ctt ctg cag 
Gly Glu Asn Ala Glu Lys Leu Leu Gin 
2705 



tea ata gta gag aag 
Ser He Val Glu Lys 
2720 



gaa cga caa gat ctt gtt tac ttt tgg aca tea 
Glu Arg Gin Asp Leu Val Tyr Phe Trp Thr Ser 
2725 2730 2735 



cca gee agt gaa gaa gga 
Pro Ala Ser Glu Glu Gly 
2740 2745 



ttc cag cct atg ccc 
Phe Gin Pro Met Pro 
2750 



atg age atg aca 
Met Ser Met Thr 



age cca tea ctg 
Ser Pro Ser Leu 



tea ate aca ata 
Ser lie Thr He 



aga cca cca gat gac caa cat ctt cct act gca aat act tgc att 
Arg Pro Pro Asp Asp Gin His Leu Pro Thr Ala Asn Thr Cys He 
2755 2760 2765 



tct cga ctt tac gtc cca 
Ser Arg Leu Tyr Val Pro 
2770 2775 



etc tat tec tct aaa cag att etc aaa 
Leu Tyr Ser Ser Lys Gin He Leu Lys 
2780 



cag aaa ttg tta etc gee att aag acc aag aat ttt ggt ttt gtg 
Gin Lys Leu Leu Leu Ala He Lys Thr Lys Asn Phe Gly Phe Val 
2785 2790 2795 

tagagtataa aaagtgtgta ttgctgtgta atattactag caaattttgt agattttttt 
ccatttgtct ataaaagtta aagtttatgg aagttaatgc tgtcataccc ccctggtggt 
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gataaaatgc agacattcct tgctgagttt atagcttaaa ggcctaagga 8610 

catttggcta tattggtttg ctagtcacca acttctgggt ctaaccccag 8670 

cagcagaaca acataattta cactgtgatt tatctttttg ctgaggggaa 8730 

atgttctgaa aattcactgc tgcctttgtg gaaactgttt cagcaaaggt 8790 

agggaatagg gaatttcaaa ataaaaaatt aagtatgt 8838 

<210> 64 

<211> 2799 

<212> PRT 

<213> NM_015902 EDD, hyperplastic discs protein, DD5 

<400> 64 

Met Thr Ser lie His Phe Val Val His Pro Leu Pro Gly Thr Glu Asp 
1 5 10 15 

Gin Leu Asn Asp Arg Leu Arg Glu Val Ser Glu Lys Leu Asn Lys Tyr 
20 25 30 

Asn Leu Asn Ser His Pro Pro Leu Asn Val Leu Glu Gin Ala Thr lie 
35 40 45 

Lys Gin Cys Val Val Gly Pro Asn His Ala Ala Phe Leu Leu Glu Asp 
50 55 60 

Gly Arg Val Cys Arg He Gly Phe Ser Val Gin Pro Asp Arg Leu Glu 
65 70 75 80 

Leu Gly Lys Pro Asp Asn Asn Asp Gly Ser Lys Leu Asn Ser Asn Ser 
85 90 95 

Gly Ala Gly Arg Thr Ser Arg Pro Gly Arg Thr Ser Asp Ser Pro Trp 
100 105 110 

Phe Leu Ser Gly Ser Glu Thr Leu Gly Arg Leu Ala Gly Asn Thr Leu 
115 120 125 

Gly Ser Arg Trp Ser Ser Gly Val Gly Gly Ser Gly Gly Gly Ser Ser 
130 135 140 

Gly Arg Ser Ser Ala Gly Ala Arg Asp Ser Arg Arg Gin Thr Arg Val 
145 150 155 160 



accttaaaga 
gcactagcaa 
ccaaagatga 
aaaaatgtaa 
tcttgtatag 



He Arg Thr Gly Arg Asp Arg Gly Ser Gly Leu Leu Gly Ser Gin Pro 
165 170 175 
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Gin Pro Val He Pro Ala Ser Val He Pro Glu Glu Leu Ile-Ser Gin 
180 185 190 



Ala Gin Val Val Leu Gin Gly Lys Ser Arg Ser Val He He Arg Glu 
195 200 *" 205 



Leu Gin Arg Thr Asn Leu Asp Val Asn Leu Ala Val Asn Asn Leu Leu 
210 .215 220 



Ser Arg Asp Asp Glu Asp Gly Asp Asp Gly Asp Asp Thr Ala Ser Glu 
225 230 235 240 



Ser Tyr Leu Pro Gly Glu Asp Leu Met Ser Leu Leu Asp Ala Asp lie 
245 250 255 



His Ser Ala His Pro Ser Val He He Asp Ala Asp Ala Met Phe Ser 
260 265 270 



Glu Asp He Ser Tyr Phe Gly Tyr Pro Ser Phe Arg Arg Ser Ser Leu 
275 280 285 



Ser Arg Leu Gly Ser Ser Arg Val Leu Leu Leu Pro Leu Glu Arg Asp 
290 295 300 



Ser Glu Leu Leu Arg Glu Arg Glu Ser Val Leu Arg Leu Arg Glu Arg 
305 310 315 ~ ~ 320 



Arg Trp Leu Asp Gly Ala Ser Phe Asp Asn Glu Arg Gly Ser Thr Ser 
325 330 335 



Lys Glu Gly Glu Pro Asn Leu Asp Lys Lys Asn Thr Pro Val Gin Ser 
340 345 350 



Pro Val Ser Leu Gly Glu Asp Leu Gin Trp Trp Pro Asp Lys Asp Gly 
355 360 365 



Thr Lys Phe He Cys He Gly Ala Leu Tyr Ser Glu Leu Leu Ala Val 
370 375 380 



Ser Ser Lys Gly Glu Leu Tyr Gin Trp Lys Trp Ser Glu Ser Glu Pro 
385 390 395 400 



Tyr Arg Asn Ala Gin Asn Pro Ser Leu His His Pro Arg Ala Thr Phe 
405 410 415 



Leu Gly Leu Thr Asn Glu Lys lie Val Leu Leu Ser Ala Asn Ser He 
420 425 430 



Arg Ala Thr Val Ala Thr Glu Asn Asn Lys Val Ala Thr Trp Val Asp 
435 440 445 
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Glu Thr Leu Ser Ser Val Ala Ser Lys Leu Glu His Thr Ala Gin Thr 
450 455 460 



Tyr Ser Glu Leu Gin Gly Glu Arg lie Val Ser Leu His Cys Cys Ala 
465 470 475 480 



Leu Tyr Thr Cys Ala Gin Leu Glu Asn Ser Leu Tyr Trp Trp Gly Val 
485 .490 495 



Val Pro Phe Ser Gin Arg Lys Lys Met Leu Glu Lys Ala Arg Ala Lys 
500 505 510 



Asn Lys Lys Pro Lys Ser Ser Ala Gly lie . Ser Ser Met Pro Asn He 
515 520 525 



Thr Val Gly Thr Gin Val Cys Leu Arg Asn Asn Pro Leu Tyr His Ala 
530 535 540 



Gly Ala Val Ala Phe Ser He Ser Ala Gly He Pro Lys Val Gly Val 
545 550 555 560 



Leu Met Glu Ser Val Trp Asn Met Asn Asp Ser Cys Arg Phe Gin Leu 
565 570 575 



Arg Ser Pro Glu Ser Leu Lys Asn Met Glu Lys Ala Ser Lys Thr Thr 
580 585 590 



Glu Ala Lys Pro Glu Ser Lys Gin Glu Pro Val Lys Thr Glu Met Gly 
595 600 605 



Pro Pro Pro Ser Pro Ala Ser Thr Cys Ser Asp Ala Ser Ser He Ala 
610 615 620 



Ser Ser Ala Ser Met Pro Tyr Lys Arg Arg Arg Ser Thr Pro Ala Pro 
625 630 635 640 



Lys Glu Glu Glu Lys Val Asn Glu Glu Gin Trp Ser Leu Arg Glu Val 
645 650 655 



Val Phe Val Glu Asp Val Lys Asn Val Pro Val Gly Lys Val Leu Lys 
660 665 670 



Val Asp Gly Ala Tyr Val Ala Val Lys Phe Pro Gly Thr Ser Ser Asn 
675 680 685 



Thr Asn Cys Gin Asn Ser Ser Gly Pro Asp Ala Asp Pro Ser Ser Leu 
690 695 700 



Leu Gin Asp Cys Arg Leu Leu Arg He Asp Glu Leu Gin Val Val Lys 
705 710 715 720 
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Thr Gly Gly Thr Pro Lys Val Pro Asp Cys Phe Gin Arg Thr Pro Lys 
725 730 735 



Lys Leu Cys He Pro Glu Lys Thr Glu He Leu Ala Val Asn Val Asp 
740 745 750 



Ser Lys Gly Val His Ala Val Leu Lys Thr Gly Asn Trp Val Arg Tyr 
755 760 765 



Cys He Phe Asp Leu Ala Thr Gly Lys Ala Glu Gin Glu Asn Asn Phe 
770 775 780 



Pro Thr Ser Ser He Ala Phe Leu Gly Gin Asn Glu Arg Asn Val Ala 
785 790 795 800 



He Phe Thr Ala Gly Gin Glu Ser Pro He He Leu Arg Asp Gly Asn 
805 810 815 

Gly Thr He Tyr Pro Met Ala Lys Asp Cys Met Gly Gly He Arg Asp 
820 825 " 830 



Pro Asp Trp Leu Asp Leu Pro Pro He Ser Ser Leu Gly Met Gly Val 
835 840 845 



His Ser Leu He Asn Leu Pro Ala Asn Ser Thr He Lys Lys Lys Ala 
850 855 860 



Ala Val He He Met Ala Val Glu Lys Gin Thr Leu Met Gin His He 
865 870 875 880 



Leu Arg Cys Asp Tyr Glu Ala Cys Arg Gin. Tyr Leu Met Asn Leu Glu 
885 890 895 



Gin Ala Val Val Leu Glu Gin Asn Leu Gin Met Leu Gin Thr Phe He 
900 905 910 



Ser His Arg Cys Asp Gly Asn Arg Asn He Leu His Ala Cys Val Ser 
915 920 925 



Val Cys Phe Pro Thr Ser Asn Lys Glu Thr Lys Glu Glu Glu Glu Ala 
930 935 940 



Glu Arg Ser Glu Arg Asn Thr Phe Ala Glu Arg Leu Ser Ala Val Glu 
* 45 950 955 960 



Ala He Ala Asn Ala He Ser Val Val .Ser Ser Asn Gly Pro Gly Asn 
965 970 975 



Arg Ala Gly Ser Ser Ser Ser Arg Ser Leu Arg Leu Arg Glu Met Met 
980 985 990 
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Arg Arg Ser Leu Arg Ala Ala Gly Leu Gly Arg His Glu Ala Gly Ala 
995 1000 ~ 1005 

Ser Ser Ser Asp His Gin Asp Pro Val Ser Pro Pro He Ala Pro 



Pro Ser Trp Val Pro Asp Pro Pro Ala Met Asp Pro Asp Gly Asp 
1025 1030 1035 



He Asp Phe He Leu Ala Pro Ala Val Gly Ser Leu Thr Thr Ala 
1040 1045 1050 



Ala Thr Gly Thr Gly Gin Gly Pro Ser Thr Ser Thr He Pro Gly 
1055 1060 . 1065 



Pro Ser Thr Glu Pro Ser Val Val Glu Ser Lys Asp Arg Lys Ala 
1070 1075 1080 



Asn Ala His Phe He Leu Lys Leu Leu Cys Asp Ser Val Val Leu 
1085 1090 ~ 1095 



Gin Pro Tyr Leu Arg Glu Leu Leu Ser Ala Lys Asp Ala Arg Gly 
1100 H05 mo 



Met Thr Pro Phe Met Ser Ala Val Ser Gly Arg Ala Tyr Pro Ala 
1115 1120 ( H25 



Ala He Thr He Leu Glu Thr Ala Gin Lys He Ala Lys Ala Glu 
1130 H35 H40 



He Ser Ser Ser Glu Lys Glu Glu Asp Val Phe Met Gly Met Val 
1145 H50 H55 



Cys Pro. Ser Gly Thr Asn Pro Asp Asp Ser Pro Leu Tyr Val Leu 
1160 H65 H70 



Cys Cys Asn Asp Thr Cys Ser Phe Thr ifrp Thr Gly Ala Glu His 
1175 1180 1185 



He Asn Gin Asp He Phe Glu Cys Arg Thr Cys Gly Leu Leu Glu 
1190 1195 1200 



Ser Leu Cys Cys Cys Thr Glu Cys Ala Arg Val Cys His Lys Gly 
1205 1210 1215 



His Asp Cys Lys Leu Lys Arg Thr Ser Pro Thr Ala Tyr Cys Asp 



1010 



1015 



1020 



1220 



1225 



1230 



Cys Trp Glu Lys Cys Lys Cys Lys Thr Leu He Ala Gly Gin Lys 
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1235 1240 1245 

Ser Ala Arg Leu Asp Leu Leu Tyr Arg Leu Leu Thr Ala Thr Asn 
1250 1255 1260 

Leu Val Thr Leu Pro Asn Ser Arg Gly Glu His Leu Leu Leu Phe 
!265 1270 1275 

Leu Val Gin Thr Val Ala Arg Gin Thr Val Glu His Cys Gin Tyr 
1280 1285 1290 

Arg Pro Pro Arg He Arg Glu Asp Arg Asn Arg Lys Thr Ala Ser 
1295 . 1300 1305 

Pro Glu Asp Ser Asp Met Pro Asp His Asp Leu Glu Pro Pro Arg 
!310 1315 1320 

Phe Ala Gin Leu Ala Leu Glu Arg Val Leu Gin Asp Trp Asn Ala 
1325 1330 1335 

Leu Lys Ser Met He Met Phe Gly Ser Gin Glu Asn Lys Asp Pro 
134 0 1345 1350 



Leu Ser Ala Ser Ser Arg He Gly His Leu Leu Pro Glu Glu Gin 
1355 i3 6 o 1365 

Val Tyr Leu Asn Gin Gin Ser Gly Thr He Arg Leu Asp Cys Phe 
1370 1375 1380 

Thr His Cys Leu He Val Lys Cys Thr Ala Asp He Leu Leu Leu 
1385 1390 1395 

Asp Thr Leu Leu Gly Thr Leu Val Lys Glu Leu Gin Asn Lys Tyr 
1400 1405 1410 

Thr Pro Gly Arg Arg Glu Glu Ala He Ala Val Thr Met Arg Phe 
1415 1420 1425 

Leu Arg Ser Val Ala Arg Val Phe Val lie Leu Ser Val Glu Met 
1430 1435 144Q 

Ala Ser Ser Lys Lys Lys Asn Asn Phe He Pro Gin Pro He Gly 
1445 1450 1455 

Lys Cys Lys Arg Val Phe Gin Ala Leu Leu Pro Tyr Ala Val Glu 
1460 1465 1470 



Glu Leu Cys Asn Val Ala Glu Ser Leu He Val Pro Val Arg Met 
1475 1480 1485 
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Gly He Ala Arg Pro Thr Ala Pro Phe Thr Leu Ala Ser Thr Ser 
1490 1495 1500 

He Asp Ala Met Gin Gly Ser Glu Glu Leu Phe Ser Val Glu Pro 
1505 1510 1515 

Leu Pro Pro Arg Pro Ser Ser Asp Gin Ser Ser Ser Ser Ser Gin 
1520 1525 1530 

Ser Gin Ser Ser Tyr He He Arg Asn Pro Gin Gin Arg Arg He 
1535 1540 1545 

Ser Gin Ser Gin Pro Val Arg Gly Arg Asp Glu Glu Gin Asp Asp 
1550 1555 1560 

He Val Ser Ala Asp Val Glu Glu Val Glu Val Val Glu Gly Val 
1565 1570 1575 

Ala Gly Glu Glu Asp His His Asp Glu Gin Glu Glu His Gly Glu 
1580 1585 1590 

Glu Asn Ala Glu Ala Glu Gly Gin His Asp Glu His Asp Glu Asp 
1595 1600 1605 

Gly Ser Asp Met Glu Leu Asp Leu Leu Ala Ala Ala Glu Thr Glu 
1610 1615 1620 

Ser Asp Ser Glu Ser Asn His Ser Asn Gin Asp Asn Ala Ser Gly 
162 5 1630 1635 

Arg Arg Ser Val Val Thr Ala Ala Thr Ala Gly Ser Glu Ala Gly 
1640 1645 1650 

Ala Ser Ser Val Pro Ala Phe Phe Ser Glu Asp Asp Ser Gin Ser 
1655 1660 1665 

Asn Asp Ser Ser Asp Ser Asp Ser Ser Ser Ser Gin Ser Asp Asp 
1670 1675 1680 

He Glu Gin Glu Thr Phe Met Leu Asp Glu Pro Leu Glu Arg Thr 
1685 1690 1695 

Thr Asn Ser Ser His Ala Asn Gly Ala Ala Gin Ala Pro Arg Ser 
1700 1705 1710 

Met Gin Trp Ala Val Arg Asn Thr Gin His Gin Arg Ala Ala Ser 
1715 1720 1725 



Thr Ala Pro Ser Ser Thr Ser Thr Pro Ala Ala Ser Ser Ala Gly 
1730 1735 1740 
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Leu lie Tyr He Asp Pro Ser Asn Leu Arg Arg Ser Gly Thr lie 
1745 1750' 1755 ' 

Ser Thr Ser Ala Ala Ala Ala Ala Ala Ala Leu Glu Ala Ser Asn 
1760 1765 1770 

Ala Ser Ser Tyr Leu Thr Ser Ala Ser Ser Leu Ala Arg Ala Tyr 
1775 1780 1785 

Ser He Val He Arg Gin He Ser Asp Leu Met Gly Leu He Pro 
1790 1795 1800 

. Lys Tyr Asn His Leu Val Tyr Ser Gin He Pro Ala Ala Val Lys 
1805 1810 1815 

Leu Thr Tyr Gin Asp Ala Val Asn Leu Gin Asn Tyr Val Glu Glu 
1820 1825 1830 

Lys Leu He Pro Thr Trp Asn Trp Met Val Ser He Met Asp Ser 
1835 1840 . 1845 

Thr Glu Ala Gin Leu Arg Tyr Gly Ser Ala Leu Ala Ser Ala Gly 
1850 1855 . I860 

Asp Pro Gly His Pro Asn His Pro Leu His Ala Ser Gin Asn Ser 
1865 1870 1875 

. Ala Arg Arg Glu Arg Met Thr Ala Arg Glu Glu Ala Ser Leu Arg 
1880 1885 1890 

Thr Leu Glu Gly Arg Arg Arg Ala Thr Leu Leu Ser Ala Arg Gin 
1895 1900 1905 

Gly Met Met Ser Ala Arg Gly Asp Phe Leu Asn Tyr Ala Leu Ser 
1910 1915 1920 

Leu Met Arg Ser His Asn Asp Glu His Ser Asp Val Leu Pro Val 
1925 1930 1935 

Leu Asp Val Cys Ser Leu Lys His Val Ala Tyr Val Phe Gin Ala 
1940 1945 1950 

Leu He Tyr Trp He Lys Ala Met Asn Gin Gin Thr Thr Leu Asp 
1955 i960 1965 

Thr Pro Gin Leu Glu Arg Lys Arg Thr Arg Glu Leu Leu Glu Leu 
1970 197 5 1980 



Gly He Asp Asn Glu Asp Ser Glu His Glu Asn Asp Asp Asp Thr 
1985 1990 1995 
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Asn Gin Ser Ala Thr Leu Asn . Asp Lys Asp Asp Asp Ser Leu Pro 
2000 2005 2010 

Ala Glu Thr Gly Gin Asn His Pro Phe Phe Arg Arg Ser Asp Ser 
2015 2020 2025 

Met Thr Phe Leu Gly Cys lie Pro Pro Asn Pro Phe Glu Val Pro 
2030 2035 2040 

Leu Ala Glu Ala lie Pro Leu Ala Asp Gin Pro His Leu Leu Gin 
2045 2050 2055 

Pro Asn Ala Arg Lys Glu Asp Leu Phe Gly Arg Pro Ser Gin Gly 
2060 2065 2070 

Leu Tyr Ser Ser Ser Ala Ser Ser Gly Lys Cys Leu Met Glu Val 
2075 2080 2085 

Thr Val Asp Arg Asn Cys Leu Glu Val Leu Pro Thr Lys Met Ser 
2090 2095 2100 

Tyr Ala Ala Asn Leu Lys Asn Val Met Asn Met Gin Asn Arg Gin 
2105 2110 2115 

Lys Lys Glu Gly Glu Glu Gin Pro Val Leu Pro Glu Glu Thr Glu 
2120 2125 2130 

Ser Ser Lys Pro Gly Pro Ser Ala His Asp Leu Ala Ala Gin Leu 
2135 2140 2145 

Lys Ser Ser Leu Leu Ala Glu lie Gly Leu Thr Glu Ser Glu Gly 
2150 2155. 2160 

Pro Pro Leu Thr Ser. Phe Arg Pro Gin Cys Ser Phe Met Gly Met 
2165 2170 2175 

Val He Ser His Asp Met Leu Leu Gly Arg Trp Arg Leu Ser Leu 
2180 2185 2190 

Glu Leu Phe Gly Arg Val Phe Met Glu Asp Val Gly Ala Glu Pro 
2195 2200 2205 

Gly Ser He Leu Thr Glu Leu Gly Gly Phe Glu Val Lys Glu Ser 
2210 2215 2220 

Lys- Phe Arg Arg Glu Met Glu Lys Leu Arg Asn Gin Gin Ser Arg 
2225 2230 2235 



Asp Leu Ser Leu Glu Val Asp Arg Asp Arg Asp Leu Leu He Gin 
2240 2245 2250 
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Gin Thr Met Arg Gin Leu Asn Asn His Phe Gly Arq Ara Cys Ala 
2255 2260 * 2 265 

Thr Thr Pro Met Ala Val His Arg Val Lys Val Thr Phe Lys Asp 
2270 2275 2280 

Glu Pro Gly Glu Gly Ser Gly Val Ala Arg Ser Phe Tyr Thr Ala 
2285 2290 2295 

He Ala Gin Ala Phe Leu Ser Asn Glu Lys Leu Pro Asn Leu Glu 
2300 2305 2310 

Cys He Gin Asn Ala Asn Lys Gly Thr His Thr Ser Leu Met Gin 
2315 2320 2325 

Arg Leu Arg Asn Arg Gly Glu Arg Asp Arg Glu Arg Glu Arg Glu 
2330 2335 2340 

Arg Glu Met Arg Arg Ser Ser Gly Leu Arg Ala Gly Ser- Arg Arg 
2345 2350 2355 

Asp Arg Asp Arg Asp Phe Arg Arg Gin Leu Ser He Asp Thr Arg 
2360 2365 2370 

Pro Phe Arg Pro Ala Ser Glu Gly Asn Pro Ser Asp Asp Pro Glu 
2375 2380 2385 

Pro Leu Pro Ala His Arg Gin Ala Leu Gly Glu Arg Leu Tyr Pro 
2390 2395 - 2400 

Arg Val Gin Ala Met Gin Pro Ala Phe Ala Ser Lys He Thr Glv 
2405 2410 2415 

Met Leu Leu Glu Leu Ser Pro Ala Gin Leu Leu Leu Leu Leu Ala 
2420 2425. 2430 

Ser Glu Asp Ser Leu Arg Ala Arg Val Asp Glu Ala Met Glu Leu 
2435 2440 2445 

He He Ala His Gly Arg Glu Asn Gly Ala Asp Ser He Leu Asp 
2450 2455 2460 

Leu Gly Leu Val Asp Ser Ser Glu Lys Val Gin Gin Glu Asn Arg 
24 65 2470 2475 

Lys Arg His Gly Ser Ser Arg Ser Val Val Asp Met Asp Leu Asp 
2480 2485 2490 



Asp Thr Asp Asp Gly Asp Asp Asn Ala Pro Leu Phe Tyr Gin Pro 
2495 2500 2505 
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Gly Lys Arg Gly Phe Tyr Thr Pro Arg Pro Gly Lys Asn Thr Glu 
2510 2515 2520 

Ala Arg Leu Asn Cys Phe Arg Asn He Gly Arg He Leu Gly Leu 
2525 2530 2535 

Cys Leu Leu Gin Asn Glu Leu Cys Pro He Thr Leu Asn Arg His 
2540 2545 2550 

Val He Lys Val Leu Leu Gly Arg Lys Val Asn Trp His Asp Phe 
2555 2560 2565 

Ala Phe Phe Asp Pro Val Met Tyr Glu Ser Leu Arg Gin Leu He 
2570 2575 2580 

Leu Ala Ser Gin Ser Ser Asp Ala Asp Ala Val Phe Ser Ala Met 
2585 2590 2595 

Asp Leu Ala Phe Ala He Asp Leu Cys Lys Glu Glu Gly Gly Gly 
2600 2605 2610 

Gin Val Glu Leu He Pro Asn Gly Val Asn He Pro Val Thr Pro 
2615 2620 2625 

Gin Asn Val Tyr Glu Tyr Val Arg Lys Tyr Ala Glu His Arg Met 
2630 2635 2640 



Leu Val Val Ala Glu Gin Pro Leu His Ala Met Arg Lys Gly Leu 
2645 2650 2655 



Leu Asp Val Leu Pro Lys Asn Ser Leu Glu Asp Leu Thr Ala Glu 
2660 2665 2670 



Asp Phe Arg Leu Leu Val Asn Gly Cys Gly Glu Val Asn Val Gin 
2675 2680 2685 



Met Leu He Ser Phe Thr Ser ■ Phe Asn Asp Glu Ser Gly Glu Asn 
2690 2695 2700 



Ala Glu Lys Leu Leu Gin Phe Lys Arg Trp Phe Trp Ser He Val 
2705 2710 * 2715 



Glu Lys Met Ser Met Thr Glu Arg Gin Asp Leu Val Tyr Phe Trp 
2720 2725 2730 



Thr Ser Ser Pro Ser Leu Pro Ala Ser Glu Glu Gly Phe Gin Pro 
2735 2740 2745 



Met Pro Ser He Thr He Arg Pro Pro Asp Asp Gin His Leu Pro 
2750 2755 2760 
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Thr Ala Asn Thr Cys He Ser Arg Leu Tyr Val Pro Leu Tyr Ser 
2765 2770 2775 



Ser Lys Gin He Leu Lys Gin Lys Leu Leu Leu Ala He Lys Thr 
2780 2785 2790 



Lys Asn Phe Gly Phe Val 
2795 



<210> 65 

<211> 4256 

<212> . DNA 

<213> NMJU4992 DAAMl 
<220> 

<221> CDS 

<222> (126) . . (3359) 

<223> 



<400> 65 

tacgagcggc tgtgaaggaa actgtttaac cggatcccat tgtaeccaga gtgcagagcc 60 

gcctttccag catgcagggg ctgctcagcg tttagtcaca tcaagaaata gaacagaatt 120 

cagcc atg gcc cca aga aag aga ggt gga cga ggt att tea ttc ate ttt 170 
Met Ala Pro Arg Lys Arg Gly Gly Arg Gly He Ser Phe He Phe 
15 10 15 

tgc tgt ttc cga aat aat gat cac cca gaa ate acg tat egg ctg cga 218 
Cys Cys Phe Arg Asn Asn Asp His Pro Glu He Thr Tyr Arg Leu Arg 
20 25 30 

aat gat age aac ttt gcg ctt cag ace atg gaa cca gca ttg ccc atg 266 
Asn Asp Ser Asn Phe Ala Leu Gin Thr Met Glu Pro Ala Leu Pro Met 
35 40 45 

ccc cct gtg gag gag ctg gat gtc atg ttc agt gaa ctg gtg gat gaa 314 
Pro Pro Val Glu Glu Leu Asp Val Met Phe Ser Glu Leu Val Asp Glu 
50 55 60 

ctg gac etc aca gac aaa cac aga gaa gcc atg ttt gca ctt cca get 362 
Leu Asp Leu Thr Asp Lys His Arg Glu Ala Met Phe Ala Leu Pro Ala 
65 70 75 

gag aaa aaa tgg caa ata tac tgt age aag aaa aag gac cag gaa gaa 410 
Glu Lys Lys Trp Gin lie Tyr Cys Ser Lys Lys Lys Asp Gin Glu Glu 
80 85 90 95 

aac aag gga get aca agt tgg cct gaa ttc tac att gat cag etc aat 458 
Asn Lys Gly Ala Thr Ser Trp Pro Glu Phe Tyr He Asp Gin Leu Asn 
100 105 HO 

tec atg get get aga aaa tct ctg ctg get tta gag aag gaa gaa gaa 50 6 

Ser Met Ala Ala Arg Lys Ser Leu Leu Ala Leu Glu Lys Glu Glu Glu 
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115 120 125 

gaa gaa aga agt aaa act ata gag agt tta aag aca gca ctg agg aca 554 
Glu Glu Arg Ser Lys Thr lie Glu Ser Leu Lys Thr Ala Leu Arg Thr 
130 135 140 

aaa cca atg agg ttt gta acc aga ttc ate gac ttg "gat ggc eta tea 602 
Lys Pro Met Arg Phe Val Thr Arg Phe He Asp Leu Asp Gly Leu Ser 
145 150 155 

tgt ate etc aac ttt eta aag acc atg gac tac gag acc tea gag tct 650 
Cys He Leu Asn Phe Leu Lys Thr Met Asp Tyr Glu Thr Ser Glu Ser 
160 165 170 175 

cga ata cat act tct etc att ggc tgt ata aag gcg tta atg aac aac 698 
Arg He His Thr Ser Leu He Gly Cys He Lys Ala Leu Met Asn Asn 
180 185 190 

tct caa ggc egg get cac gtc ctg get cat tct gag agt att aat gta 746 
Ser Gin Gly Arg Ala His Val Leu Ala His Ser Glu Ser He Asn Val 
195 200 205 

att get cag agt ctg age aca gag aac att aaa acg aag gtg gee gtg 794 
He Ala Gin Ser Leu Ser Thr Glu Asn He Lys. Thr Lys Val Ala Val 
210 '215 * 220 

ctg gaa ate ttg ggc gee gtg tgc ctg gtt ccc ggg ggc cac aag aag 842 
Leu Glu He Leu Gly Ala Val Cys Leu Val Pro Gly Gly His Lys Lys 
225 230 235 

gtt ctg cag gee atg ctg cac tac cag aag tat gee age gaa agg acc 890 
Val Leu Gin Ala Met Leu His Tyr Gin Lys Tyr Ala Ser Glu Arg Thr 
240 245 t 250 255 

cgc ttt cag aca tta , att aac gac ttg gat aaa age act ggg egg tat 938 
Arg Phe Gin Thr Leu He Asn Asp Leu Asp Lys Ser Thr Gly Arg Tyr 
260 265 270 

cga gat gaa gtg agt etc aag act gee ate atg tec ttc att aat gca 986 
Arg Asp Glu Val Ser Leu Lys Thr Ala He Met Ser Phe He Asn Ala 
275 280 285 

gtg etc age caa ggt gca gga gtg gag agt ttg gac ttt aga ctt cat 1034 
Val Leu Ser Gin Gly Ala Gly Val Glu Ser Leu Asp Phe Arg Leu His 
290 295 300 

ctt cgc tat gaa ttt ctg atg tta gga att caa cct gta ata gat aaa 1082 
Leu Arg Tyr Glu Phe Leu Met Leu Gly He Gin Pro Val He Asp Lys 
305 310 315 

tta agg gaa cac gaa aat tea aca tta gat agg cat tta gac ttt ttt 1130 
Leu Arg Glu His Glu Asn Ser Thr Leu Asp Arg His Leu Asp Phe Phe 
320 325 330 ~ 335 

gaa atg etc cga aat gaa gat gaa eta gaa ttt gee aaa aga ttt gaa 1178 
Glu Met Leu Arg Asn Glu Asp Glu Leu Glu Phe Ala Lys Arg Phe Glu 
340 345 350 

ctg gtt cac ata gac aca aaa agt gca act cag atg ttt gag ctg acc 1226 
Leu Val His He Asp Thr Lys Ser Ala Thr Gin Met Phe Glu Leu Thr 
355 360 365 

agg aag agg ctg aca cat agt gaa get tac ccg cat ttc atg tec ate 1274 
Arg Lys Arg Leu Thr His Ser Glu Ala Tyr Pro His Phe Met Ser He 
370 375 380 

ctg cac cac tgc etc caa atg cct tac aag agg agt ggc aac act gtt 1322 
Leu His His Cys Leu Gin Met Pro Tyr Lys Arg Ser Gly Asn Thr Val 
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385 390 395 • 

cag tac tgg eta eta eta gat aga att ata cag cag ata gtt ate cag 1370 
Gin Tyr Trp Leu Leu Leu Asp Arg lie lie Gin Gin He Val He Gin 
400 405 410 415 

aat gac aaa gga cag gac cct gac tec aca cct ttg gaa aac ttt aat 1418 
Asn Asp Lys Gly Gin Asp Pro Asp Ser Thr Pro Leu Glu Asn Phe Asn 
420 425 430 

att aag aat gtc gta cga atg ttg gtt aat gaa aat gaa gtt aag cag 14 66 
He Lys Asn Val Val Arg Met Leu Val Asn Glu Asn Glu Val Lys Gin 
435 440 445 

tgg aaa gaa caa gcg gaa aaa atg aga aaa gag cac aat gag. eta caa 1514 
Trp Lys Glu Gin Ala Glu Lys Met Arg Lys Glu His Asn Glu Leu Gin 
450 455 .460 

cag aaa ctg gaa aag aaa gaa cga gaa tgt gat get aag act caa gag 1562 
Gin Lys Leu Glu Lys Lys Glu Arg Glu Cys Asp Ala Lys Thr Gin Glu 
465 470 475 

aag gaa gag atg atg cag ace tta aat aaa atg aaa gag aaa ctt gaa 1610 
Lys Glu Glu Met Met Gin Thr Leu Asn Lys Met Lys Glu Lys Leu Glu 
480 485 490 495 

aag gag act act gag cat aag caa gtc aag cag cag gtg gcg gac etc 1658 
Lys Glu Thr Thr Glu His Lys Gin Val Lys Gin Gin Val Ala Asp Leu 
500 505 510 

aca gca cag etc cat gag etc age agg agg gee gtc tgt get tea ate 1706 
Thr Ala Gin Leu His Glu Leu Ser Arg Arg Ala Val Cys Ala Ser He 
515' 520 525 

cca ggt gga ccc teg cct gga gca cca gga ggg ccc ttt cct tec tct 1754 
Pro Gly Gly Pro Ser Pro Gly Ala Pro Gly Gly Pro Phe Pro Ser Ser 
530 535 540 

gtg cct gga tct etc ctt cct ccc cca cca ccc cca cct eta cca ggt 1802 
Val Pro Gly Ser Leu Leu Pro Pro Pro Pro Pro Pro Pro Leu Pro Gly 
545 550 555 

ggg atg ctt ccc cct cca ccg cct ccc etc cct cca ggt ggc cct cct 1850 
Gly Met Leu Pro Pro Pro Pro Pro Pro Leu Pro Pro Gly Gly Pro Pro 
560 565 570 575 

cct ccc cca ggg cct cct ccc tta ggg gca ate atg cca cct cct ggt 1898 
Pro Pro Pro Gly Pro Pro Pro Leu Gly Ala He Met Pro Pro Pro Gly 
580 585 590 

get cca atg ggc eta gca ctg aag aag aaa age att cct cag ccc aca 1946 
Ala Pro Met Gly Leu Ala Leu Lys Lys Lys Ser He Pro Gin Pro Thr 
595 600 605 

aat gee ctg aaa tec ttc aac tgg tct aaa ctg ccc gag aac aaa ctg 1994 
Asn Ala Leu Lys Ser Phe Asn Trp Ser Lys Leu Pro Glu Asn Lys Leu 
610 615 620 

gaa gga aca gta tgg ace gaa att gat gat aca aaa gtc ttc aaa att 2042 
Glu Gly Thr Val Trp Thr Glu He Asp Asp Thr Lys Val Phe Lys He 
625 630 635 

eta gat ctt gaa gac ctg gaa aga ace ttc tct gee tat caa aga cag 2090 
Leu Asp Leu Glu Asp Leu Glu Arg Thr Phe Ser Ala Tyr Gin Arg Gin 
640 645 650 655 

cag gat ttc ttt gtg aac agt aac tec aag cag aaa gaa gca gat gee 2138 
Gin Asp Phe Phe Val Asn Ser Asn Ser Lys Gin Lys Glu Ala Asp Ala 
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660 '665 670 

att gat gac act ctg agt tec aaa ctt aaa gtt aaa gag ctt teg gtg 
He Asp Asp Thr Leu Ser Ser Lys Leu Lys Val Lys Glu Leu Ser Val 
675 680 685 

att gat ggt egg aga get cag aat tgc aac ate ctt eta teg agg ttg 
He Asp Gly Arg Arg Ala Gin Asn Cys Asn He Leu Leu Ser Arg Leu 
690 695 700 

aaa tta tec aat gac gaa ate aaa egg gca att eta aca atg gac gaa 
Lys Leu Ser Asn Asp Glu He Lys Arg Ala He Leu Thr Met Asp Glu 
705 710 715 

cag gaa gat ctg ccc aag gac atg ttg gaa cag etc ttg aaa ttt gtt 
Gin Glu Asp Leu Pro Lys Asp Met Leu Glu Gin Leu Leu Lys Phe Val 
720 725 730 735 

cct gaa aaa agt gac att gac eta ttg gag gaa cat aaa cac gaa ctg 
Pro Glu Lys Ser Asp He Asp Leu Leu Glu Glu His Lys His Glu Leu 
740 745 750 

gat egg atg gec aag get gat agg ttc ctt ttt gag atg age cga att 
Asp Arg Met Ala Lys Ala Asp Arg Phe Leu Phe Glu Met Ser Arg He 
755 760 765 

aat cac tat cag caa agg ttg caa teg ctg tac ttc aaa aag aag ttt 
Asn His Tyr Gin Gin Arg Leu Gin Ser Leu Tyr Phe Lys Lys Lys Phe 
770 775 780 

gca gag cgt gtg gca gaa gtg aaa cct aaa gtg gaa gca att cgt tct 
Ala Glu Arg Val Ala Glu Val Lys Pro Lys Val Glu Ala He Arg Ser 
785 790 795 



ggc tea gaa gag gtg ttt agg agt ggt gec etc aag cag ttg ctg gag 
Gly Ser Glu Glu Val Phe Arg Ser Gly Ala Leu Lys Gin Leu Leu Glu 
800 805 810 815 

gtg gtt ttg gca ttt gga aat tat atg aat aaa ggt caa aga ggg aat 
Val Val Leu Ala Phe Gly Asn Tyr Met Asn Lys Gly Gin Arg Gly Asn 
820 825 830 

gca tat gga ttc aag ata tct age eta aac aaa att get gac aca aaa 
Ala Tyr Gly Phe Lys He Ser Ser Leu Asn Lys He Ala Asp Thr Lys 
835 840 845 

tec age ate gac aaa aac att acc ctt ttg cac tat etc ate act att 
Ser Ser He Asp Lys Asn He Thr Leu Leu His Tyr Leu He Thr He 
850 855 860 

gtg gaa aat aag tac ccc agt gtt etc aat eta aat gaa gaa ttg cga 
Val Glu Asn Lys Tyr Pro Ser Val Leu Asn Leu Asn Glu Glu Leu Arg 
865 870 875 

gat att cct caa get gcg aaa gta aac atg act gag ctg gac aaa gaa 
Asp He Pro Gin Ala Ala Lys Val Asn Met Thr Glu Leu Asp Lys Glu 
880 885 890 895 

ata agt acc ttg aga agt ggc ttg aaa gca gta gag aca gag ctg gaa 
He Ser Thr Leu Arg Ser Gly Leu Lys Ala Val Glu Thr Glu Leu Glu 
900 905 910 

tat cag aag tct cag ccc cca cag ccc gga gat aag ttt gtg tct gtt 
Tyr Gin Lys Ser Gin Pro Pro Gin Pro Gly Asp Lys Phe Val Ser Val 
915 920 925 

gtc age cag ttc ate aca gta gec age ttc age ttc tct gat gtt gaa 
Val Ser Gin Phe He Thr Val Ala Ser Phe Ser Phe Ser Asp Val Glu 
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2474 
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930 935 940 

gac ctt eta gca gaa get aaa gac ctg'ttt act aaa gca gtg aag cac 3002 
Asp Leu Leu Ala Glu Ala Lys Asp Leu Phe Thr Lys Ala Val Lys His 
945 950 955 

ttt ggg gaa gag get ggc aaa ata caa cca gat gag ttc ttt ggc att 3050 
Phe Gly Glu Glu Ala Gly Lys He Gin Pro Asp Glu Phe Phe Gly He 
960 965 970 975 

ttt gat caa ttt ctt caa get gtg tea gaa gee aaa caa gaa aac gaa 3098 
Phe Asp Gin Phe Leu Gin Ala Val Ser Glu Ala Lys Gin Glu Asn Glu 
980 985 990 

aat atg aga aag aaa aag gag gaa gaa gaa cgt cga get cgc atg gaa 3146 
Asn Met Arg Lys Lys Lys Glu Glu Glu Glu Arg Arg Ala Arg Met Glu 
995 1000 1005 

get cag etc aaa gaa caa cgt gaa agg gaa cgt aaa .atg aga aaa 3191 
Ala Gin Leu Lys Glu Gin Arg Glu Arg Glu Arg Lys Met Arg Lys 
1010 1015 1020 

get aaa gag aat agt gaa gaa age gga gag ttt gat gac ctt gtt 3236 
Ala Lys Glu Asn Ser Glu Glu Ser Gly Glu Phe Asp Asp Leu Val 
1025 1030 1035 

tea get tta cgc tea gga gaa gtg ttt gac aaa gac ctt tct aaa 3281 
Ser Ala Leu Arg Ser Gly Glu Val Phe Asp Lys Asp Leu Ser Lys 
1040 1045 " " 1050 

ttg aaa egg aat cgc aaa cgt att ace aac cag atg act gac age 3326 
Leu Lys Arg Asn Arg Lys Arg He Thr Asn Gin Met Thr Asp Ser 
1055 1060 1065 

age aga gag aga cca ate aca aaa ctt aat ttc taattttcca 3369 
Ser Arg Glu Arg Pro He Thr Lys Leu Asn Phe 
1070 1075 



tgaatacttt 


tttttagaaa gctcattagc 


agccctctaa 


agtgactaga aegtttcatt 


3429 


acactgcctt 


gcaatccaaa cagtggcaat 


tttttccttc 


atctgtgagt gaatgtgtga 


3489 


acgtgtgtat 


gtaaatgtat gtgtgtatat 


attaaaaaat 


gtatatagat gtctgagtgt 


3549 


tgtctggaga 


ectataegta tggttaaaaa 


gatttatgtt 


aatgtatgtg ctccaaaacc 


3609 


tttcgtgtat 


gcattcacat tgagtgtggc 


tcattttctt 


tccccgaacg ccatgactgt 


3669 


tcagaagcac 


aatactatct cctgaaagag 


ataagagaca 


ttccctagat tcaaaggcaa 


3729 


aacagaagaa 


acaaacaaac aaacaaacaa 


agettgeaaa 


atattttatg gtttccaagc 


3789 


ttgatatcct 


ttaaaattat tttcattgat 


ggaactggag 


ttgttggaaa aacatagatt 


3849 


taaaatgatt 


tttgatagct gacattgtga 


tgttgatgta 


tcacatcagt aataggacca 


3909 


gctttgaatt 


tctgacattg gtgtggggat 


acagtctgta 


aatgtttatt gagaacatct 


3969 


tgcacacaat 


ttgaattatg tagaatgtca 


atcaagtttt 


tgtatattta aaagttggac 


4029 


atcaattttt 


tcccctgatt tcatcaagtt 


atctctgcca 


agtgctcttg ataatttctt 


4089 


cagatttttg 


gaaaaaaaca ctatataaat 


gcaatccatg 


ctttttttaa agaacaacat 


4149 


tgccagagta 


tgcttgttct aacaatatag 


atatataaac 


cttaaaaata ataaaatatc 


4209 


tcacccaaga 


cttaaaggaa gaattctctg 


aagggataaa 


gattact 


4256 
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<210> 66 
<211> 1078 
<212> PRT 

<213> NM_014992 DAAM1 
<400> 66 

Met Ala Pro Arg Lys Arg Gly Gly Arg Gly lie Ser Phe lie Phe Cys 
15 10 15 

Cys Phe Arg Asn Asn Asp His Pro Glu lie Thr Tyr Arg Leu Arg Asn 
20 25 30 

Asp Ser Asn Phe Ala Leu Gin Thr Met Glu Pro Ala Leu Pro Met Pro 
35 40 45 

Pro Val Glu Glu Leu Asp Val Met Phe; Ser Glu Leu Val Asp Glu Leu 
50 55 60 



Asp Leu Thr Asp Lys His Arg Glu Ala Met Phe Ala Leu Pro Ala Glu 
65 70 75 80 



Lys Lys Trp Gin He Tyr Cys Ser Lys Lys Lys Asp Gin Glu Glu Asn 
85 90 95 



Lys Gly Ala Thr Ser Trp Pro Glu Phe Tyr He Asp Gin Leu Asn Ser 
100 105 110 



Met Ala Ala Arg Lys Ser Leu Leu Ala Leu Glu Lys Glu Glu Glu Glu 
115 120 125 



Glu Arg Ser Lys Thr lie Glu Ser Leu Lys Thr Ala Leu Arg Thr Lys 
130 135 140 



Pro Met Arg Phe Val Thr Arg Phe He Asp Leu Asp Gly Leu Ser Cys 
145 150 155 160 



He Leu Asn Phe Leu Lys Thr Met Asp Tyr Glu Thr Ser Glu Ser Arg 
165 170 175 



He His Thr Ser Leu He Gly Cys He Lys Ala Leu Met Asn Asn Ser 
180 185 190 



Gin Gly Arg Ala His Val Leu Ala His Ser Glu Ser He Asn Val He 
195 200 205 



Ala Gin Ser Leu Ser Thr Glu Asn He Lys Thr Lys Val Ala Val Leu 
210 215 220 



WO 2004/022778 



PCT/AU2003/001166 



188 



Glu He Leu Gly Ala Val Cys Leu Val Pro Gly Gly His Lys Lys Val 
225 230- 235 240 



Leu Gin Ala Met Leu His Tyr Gin Lys Tyr Ala. Ser Glu Arg Thr Arg 
245 250 255 

Phe Gin Thr Leu He Asn Asp Leu Asp Lys Ser Thr Gly Arg Tyr Arg 
260 265 270 



Asp Glu Val Ser Leu Lys Thr Ala He Met Ser Phe He Asn Ala Val 
275 280 285 



Leu Ser Gin Gly Ala Gly Val Glu Ser Leu Asp Phe Arg Leu His Leu 
290 295 300 



Arg Tyr Glu Phe Leu Met Leu Gly He Gin Pro Val He Asp Lys Leu 
305 310 315 320 



Arg Glu His Glu Asn Ser Thr Leu Asp Arg His Leu Asp Phe Phe Glu 
325 330 " 335 



Met Leu Arg Asn Glu Asp Glu Leu Glu Phe Ala Lys Arg Phe Glu Leu 
340 345 350 



Val His He Asp Thr Lys Ser Ala Thr Gin Met Phe Glu Leu Thr Arg 
355 .360 365 

Lys Arg Leu Thr His Ser Glu Ala Tyr Pro His Phe Met Ser He Leu 
370 375 380 

His His Cys Leu Gin Met Pro Tyr Lys Arg Ser Gly Asn Thr Val Gin 
385 390 395 400 

Tyr Trp Leu Leu Leu Asp Arg He He Gin Gin He Val He Gin Asn 
405 410 415 



Asp Lys Gly Gin Asp Pro Asp Ser Thr Pro Leu Glu Asn Phe Asn He 
420 425 430 



Lys Asn Val Val Arg Met Leu Val Asn Glu Asn Glu Val Lys Gin Trp 
435 440 445 



Lys Glu Gin Ala Glu Lys Met Arg Lys Glu His Asn Glu Leu Gin Gin 
450 .455 460 



Lys Leu Glu Lys Lys Glu Arg Glu Cys Asp Ala Lys Thr Gin Glu Lys 
465 470 475 480 



Glu Glu Met Met Gin Thr Leu Asn Lys Met Lys Glu Lys Leu Glu Lys 
485 490 495 
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Glu Thr Thr Glu His Lys Gin Val Lys Gin Gin Val Ala Asp Leu Thr 
500 505 510 



Ala Gin Leu His Glu Leu Ser Arg Arg Ala Val Cys Ala Ser lie Pro 
515 520 525 



Gly Gly Pro Ser Pro Gly Ala Pro Gly Gly Pro Phe Pro Ser Ser Val 
530 535 540 ' 



Pro Gly Ser Leu Leu Pro Pro Pro Pro Pro Pro Pro Leu Pro Gly Gly 
545 550 555 560 



Met Leu Pro Pro Pro Pro Pro Pro Leu Pro Pro Gly Gly Pro Pro Pro 
565 570 575 



Pro Pro Gly Pro Pro Pro Leu Gly Ala lie Met Pro Pro Pro Gly Ala 
580 585 590 



Pro Met Gly Leu Ala Leu Lys Lys Lys Ser He Pro Gin Pro Thr Asn 
595 600 605 



Ala Leu Lys Ser Phe Asn Trp Ser Lys Leu Pro Glu Asn Lys Leu Glu 
610 615 620 



Gly Thr Val Trp Thr Glu He Asp Asp Thr Lys Val Phe Lys He Leu 
625 630 635 640 



Asp Leu Glu Asp Leu Glu Arg Thr Phe Ser Ala Tyr Gin Arg Gin Gin 
645 650 ' 655 



Asp Phe Phe Val Asn Ser Asn Ser Lys Gin Lys Glu Ala Asp Ala He 
660 665 670 



Asp Asp Thr Leu Ser Ser Lys Leu Lys Val Lys Glu Leu Ser Val lie 
675 680 685 



Asp Gly Arg Arg Ala Gin Asn Cys Asn He Leu Leu Ser Arg Leu Lys 
690 695 700 



Leu Ser Asn Asp Glu He Lys Arg Ala He Leu Thr Met Asp Glu Gin 
705 710 715 720 



Glu Asp Leu Pro Lys Asp Met Leu Glu Gin Leu Leu Lys Phe Val Pro 
725 730 735 



Glu Lys Ser Asp He Asp Leu Leu Glu Glu His Lys His Glu Leu Asp 
740 745 ' 750 



Arg Met Ala Lys Ala Asp Arg Phe Leu Phe Glu Met Ser Arg He Asn 
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755 760 765 



His Tyr Gin Gin Arg Leu Gin Ser Leu Tyr Phe Lys Lys Lys Phe Ala 
770 775 780 



Glu Arg Val Ala Glu Val Lys Pro Lys Val Glu Ala He Arg Ser Gly 
785 790 795 800 



Ser Glu Glu Val Phe Arg Ser Gly Ala Leu Lys Gin Leu Leu Glu Val 
805 810 815 



Val Leu Ala Phe Gly Asn Tyr Met Asn Lys Gly Gin Arg Gly Asn Ala 
820 825 830 



Tyr Gly Phe Lys He Ser Ser Leu Asn , Lys . He Ala Asp Thr Lys Ser 
835 840 845 



Ser He Asp Lys Asn He Thr Leu Leu His Tyr Leu He Thr He Val 
850 855 860 



Glu Asn Lys Tyr Pro Ser Val Leu Asn Leu Asn Glu Glu Leu Arg Asp 
865 870 875 880 



He Pro Gin Ala Ala Lys Val Asn Met Thr Glu Leu Asp Lys Glu He 
885 890 895 



Ser Thr Leu Arg Ser Gly Leu Lys Ala Vai Glu Thr Glu Leu Glu Tyr 
900 905 910 



Gin Lys Ser Gin Pro Pro Gin Pro Gly Asp Lys Phe Val Ser Val Val 
915 920 925 



Ser Gin Phe He Thr Val Ala Ser Phe Ser Phe Ser Asp Val Glu Asp 
930 935 940 



Leu Leu Ala Glu Ala Lys Asp Leu Phe Thr Lys Ala Val Lys His Phe 
945 950 955 960 



Gly Glu Glu Ala Gly Lys lie Gin Pro Asp Glu Phe Phe Gly He Phe 
965 970 975 



Asp Gin Phe Leu Gin Ala Val Ser Glu Ala Lys Gin Glu Asn Glu Asn 
980 985 990 



Met Arg Lys Lys Lys Glu Glu Glu Glu Arg Arg Ala Arg Met Glu Ala 
995 1000 1005 



Gin Leu Lys Glu Gin Arg Glu Arg Glu Arg Lys Met Arg Lys Ala 
1010 1015 1020 



Lys Glu Asn Ser Glu Glu Ser ' ' Gly Glu Phe Asp Asp Leu Val Ser 



WO 2004/022778 



191 



PCT/AU2003/001166 



1025 1030 1035 



Ala Leu Arg Ser Gly Glu Val Phe Asp Lys Asp Leu Ser Lys Leu 
1040 1045 1050 



Lys Arg Asn Arg Lys Arg He Thr Ash Gin Met Thr Asp Ser Ser 
1055 1060 1065 



Arg Glu Arg Pro He Thr Lys Leu Asn Phe 
1070 1075 



<210> 67 

<211> 1096 

<212> DNA 

<213> NMJJ02122 MHC class II DQ alpha 1/ HLA-DQAl 
<220> 

<221> CDS 

<222> (1)..(765) 

<223> 



<400> 67 

atg ate eta aac aaa get ctg ctg ctg ggg gee etc get ctg ace ace 48 

Met He Leu Asn Lys Ala Leu Leu Leu Gly Ala Leu Ala Leu Thr Thr 

1 5 10 15 

gtg atg age ccc tgt gga ggt gaa gac att gtg get gac cac gtt gee 96 
Val Met Ser Pro Cys Gly Gly Glu Asp He Val Ala Asp His Val Ala 
20 25 30 

tct tgt ggt gta aac ttg tac cag ttt tac ggt ccc tct ggc cag tac 144 
Ser Cys Gly Val Asn Leu Tyr Gin Phe Tyr Gly Pro Ser Gly Gin Tyr 
35 40 45 

ace cat gaa ttt gat gga gat gag cag ttc tac gtg gac ctg gag agg 192 
Thr His Glu Phe Asp Gly Asp Glu Gin Phe Tyr Val Asp Leu Glu Arg 
50 55 60 

aag gag act gee tgg egg tgg cct gag ttc age aaa ttt gga ggt ttt 240 
Lys Glu Thr Ala Trp Arg Trp Pro Glu Phe Ser Lys Phe Gly Gly Phe 
65 70 75 80 

gac ccg cag ggt gca ctg aga aac atg get gtg gca aaa cac aac ttg 288 
Asp Pro Gin Gly Ala Leu Arg Asn Met Ala Val Ala Lys His Asn Leu 
85 90 95 

aac ate atg att aaa cgc tac aac tct ace get get ace aat gag gtt 336 
Asn He Met lie Lys Arg Tyr Asn Ser Thr Ala Ala Thr Asn Glu Val 
100 105 110 

cct gag gtc aca gtg ttt tec aag tct ccc gtg aca ctg ggt cag ccc 384 
Pro Glu Val Thr Val Phe Ser Lys Ser Pro Val Thr Leu Gly Gin Pro 
115 120 125 
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aac acc etc att tgt ctt gtg gac aac ate ttt cct cct.gtg gtc aac 432 
Asn Thr Leu lie Cys Leu Val Asp Asn He Phe Pro Pro Val Val Asn 
130 135 140 

ate aca tgg ctg age aat ggg cag tea gtc aca gaa ggt gtt tct gag 480 
He Thr Trp Leu Ser Asn Gly Gin Ser Val Thr Glu Gly Val Ser Glu 
145 150 155 ~ 160 

acc age ttc etc tec aag agt gat cat tec ttc ttc aag ate agt tac 528 
Thr Ser Phe Leu Ser Lys Ser Asp His Ser Phe Phe Lys He Ser Tyr 
165 170 175 

etc acc ttc etc cct tct get gat gag att tat gac tgc aag gtg gag 576 
Leu Thr Phe Leu Pro Ser Ala Asp Glu He Tyr Asp Cys Lys Val Glu 
180 185 190 

cac tgg ggc ctg gac cag cct ctt ctg aaa cac tgg gag cct gag att 624 
His Trp Gly Leu Asp Gin Pro Leu Leu Lys His Trp Glu Pro Glu He 
195 200 205 

cca gec cct atg tea gag etc aca gag act gtg gtc tgt gee ctg ggg 672 
Pro Ala Pro Met Ser Glu Leu Thr Glu Thr Val Val Cys Ala Leu Gly 
210 215 .220 

ttg tct gtg ggc etc atg ggc att gtg gtg ggc act gtc ttc ate ate 720 
Leu Ser Val Gly Leu Met Gly He Val Val Gly Thr Val Phe He He 
225 230 235 240 

caa ggc ctg cgt tea gtt ggt get tec aga cac caa ggg cca ttg 765 
Gin Gly Leu Arg Ser Val Gly Ala Ser Arg His Gin Gly Pro Leu 
245 250 255 

tgaatcccat cctggaaggg aaggtgcatc gccatctaca ggagcagaag aatggacttg 825 

ctaaatgacc tagcactatt ctctggcccg atttatcata tcccttttct cctccaaata 885 

tttctcctct caccttttct ctgggactta agetgetata tcccctcaga gctcacaaat 945 

gectttacat tctttccctg acctcctgat tttttttttc ttttctcaaa tgttacctac 1005 

aatacatgee tggggtaagc cacccggcta cctaattcct cagtaacctc catctaaaat 1065 

ctccaaggaa gcaataaatt ccttttatga g 1096 

<210> 68 

<211> 255 

<212> PRT 

<213> NM_002122 MHC class II DQ alpha 1/ HLA-DQA1 



<400> 68 

Met He Leu Asn Lys Ala Leu Leu Leu Gly Ala Leu Ala Leu Thr Thr 
1 5 10 15 



Val Met Ser Pro Cys Gly Gly Glu Asp He Val Ala Asp His Val Ala 
20 25 30 



Ser Cys Gly Val Asn Leu Tyr Gin Phe Tyr Gly Pro Ser Gly Gin Tyr 
35 40 45 
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Thr His Glu Phe Asp Gly Asp Glu Gin Phe Tyr Val Asp Leu Glu Arg 
50 55 60 

Lys Glu Thr Ala Trp Arg Trp Pro Glu Phe' Ser Lys Phe Gly Gly Phe 
65 70 75 80 

Asp Pro Gin Gly Ala Leu Arg Asn Met Ala Val Ala Lys His Asn Leu 
85 so 95 

Asn He Met He Lys Arg Tyr Asn Ser Thr Ala Ala Thr Asn Glu Val 
100 105 HO 

Pro Glu Val Thr Val Phe Ser Lys Ser Pro Val Thr Leu Gly Gin Pro 
115 120 125 

Asn Thr Leu He Cys Leu Val Asp Asn He Phe Pro Pro Val Val Asn 
130 135 140 

He Thr Trp Leu Ser Asn Gly Gin Ser Val Thr Glu Gly Val Ser Glu 
145 150 ' 155 160 

Thr Ser Phe Leu Ser Lys Ser Asp His Ser Phe Phe Lys He Ser Tyr 
165 170 175 

Leu Thr Phe Leu Pro Ser Ala Asp Glu He Tyr Asp Cys Lys Val Glu 
180 185 190 

His Trp Gly Leu Asp Gin Pro Leu Leu Lys His Trp Glu Pro Glu He 
195 200 205 

Pro Ala Pro Met Ser Glu Leu Thr Glu Thr Val Val Cys Ala Leu Gly 
21° 215 220 

Leu Ser Val Gly Leu Met Gly He Val Val Gly Thr Val Phe He He 
225 230 235 240 

Gin Gly Leu Arg Ser Val Gly Ala Ser Arg His Gin Gly Pro Leu 
245 250 255 

<210> 69 
<211> 2820 
<212> DNA 

<213> NMJJ03014 SFRP4, secreted frizzled-related protein 4 



<220> 

<221> CDS 
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<222> (238) . . (1275) 
<223> 



<400> 69 

ggcgggttcg cgccccgaag gctgagagct ggcgctgctc gtgccctgtg tgccagacgg 60 

cggagctccg cggccggacc ccgcggcccc gctttgctgc cgactggagt ttgggggaag 120 

aaactctcct gcgccccaga agatttcttc ctcggcgaag ggacagcgaa agatgagggt 180 

ggcaggaaga gaaggcgctt tctgtctgcc ggggtcgcag cgcgagaggg cagtgcc 237 

atg ttc etc tec ate eta gtg gcg ctg tgc ctg tgg ctg cac ctg gcg 285 
Met Phe Leu Ser lie Leu Val Ala Leu Cys Leu Trp Leu His Leu Ala 
15 10 15 

ctg ggc gtg cgc ggc gcg ccc tgc gag gcg gtg cgc ate cct atg tgc 333 
Leu Gly Val Arg Gly Ala Pro Cys Glu Ala Val Arg lie Pro Met Cys 
20 25 30 

egg cac atg ccc tgg aac ate acg egg atg ccc aac cac ctg cac cac 381 
Arg His Met Pro Trp Asn He Thr Arg Met Pro Asn His Leu His His 
35 40 45 

age acg cag gag aac gec ate ctg gee ate gag cag tac gag gag ctg 429 
Ser Thr Gin Glu Asn Ala He Leu Ala He Glu Gin Tyr Glu Glu Leu 
50 '55 60 

gtg gac gtg aac tgc age gec gtg ctg cgc ttc ttc ttc tgt gec atg 477 
Val Asp Val Asn Cys Ser Ala Val Leu Arg Phe Phe Phe Cys Ala Met 
65 70 75 " 80 

tac gcg ccc att tgc acc ctg gag ttc ctg cac gac cct ate aag ccg 525 
Tyr Ala Pro He Cys Thr Leu Glu Phe Leu His Asp Pro He Lys Pro 
85 90 95 

tgc aag teg gtg tgc caa cgc gcg cgc gac gac tgc gag ccc etc atg 573 
Cys Lys Ser Val Cys Gin Arg Ala Arg Asp Asp Cys Glu Pro Leu Met 
100 105 no 



aag atg tac aac cac age tgg ccc gaa age ctg gec tgc gac gag ctg 
Lys Met Tyr Asn His Ser Trp Pro Glu Ser Leu Ala Cys Asp Glu Leu 
115 120 125 



621 



cct gtc tat gac cgt ggc gtg tgc att teg cct gaa gec ate gtc acg 669 
Pro Val Tyr Asp Arg Gly Val Cys He Ser Pro Glu Ala lie Val Thr 
130 135 140 

gac etc ccg gag gat gtt aag tgg ata gac ate aca cca gac atg atg 717 
Asp Leu Pro Glu Asp Val Lys Trp He Asp He Thr Pro Asp Met Met 
i45 150 155 160 

gta cag gaa agg cct ctt gat gtt gac tgt aaa cgc eta age ccc gat 7 65 

Val Gin Glu Arg Pro Leu Asp Val Asp Cys Lys Arg Leu Ser Pro Asp 
165 170 " 175 

egg tgc aag tgt aaa aag gtg aag cca act ttg gca acg tat etc age 813 
Arg Cys Lys Cys Lys Lys Val Lys Pro Thr Leu Ala Thr Tyr Leu Ser 
180 185 190 

aaa aac tac age tat gtt att cat gee aaa ata aaa get gtg cag agg 861 
Lys Asn Tyr Ser Tyr Val He His Ala Lys He Lys Ala Val Gin Arg 
195 200 205 

agt ggc tgc aat gag gtc aca acg gtg gtg gat gta aaa gag ate ttc. 909 
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Ser Gly Cys Asn Glu Val Thr Thr Val Val Asp Val Lys Glu lie Phe 
210 215 220 

aag tec tea tea ccc. ate cct cga act caa gtc ccg etc att aca aat 957 
Lys Ser Ser Ser Pro lie Pro Arg Thr Gin Val Pro Leu lie Thr Asn 
225 230 235 . 240 

tct tct tgc cag tgt cca cac ate ctg ccc cat caa gat gtt etc ate 1005 
Ser Ser Cys Gin Cys Pro His lie Leu Pro His Gin Asp Val Leu lie 
245 250 255 

atg tgt tac gag tgg cgt tea agg atg atg ctt ctt gaa aat tgc tta 1053 
Met Cys Tyr Glu Trp Arg Ser Arg Met Met Leu Leu Glu Asn Cys Leu 
260 265 270 

gtt gaa aaa tgg aga gat cag ctt agt aaa aga tec ata cag tgg gaa 1101 
Val Glu Lys Trp Arg Asp Gin Leu Ser Lys Arg Ser He Gin Trp Glu 
275 280 285 

gag agg ctg cag gaa cag egg aga aca gtt cag gac aag aag aaa aca 1149 
Glu Arg Leu Gin Glu Gin Arg Arg Thr Val Gin Asp Lys Lys Lys Thr 
290 295 300 

gee ggg cgc ace agt cgt agt aat ccc ccc aaa cca aag gga aag cct 1197 
Ala Gly Arg Thr Ser Arg Ser Asn Pro Pro Lys Pro Lys Gly Lys Pro 
305 310 315 "* 320 

cct get ccc aaa cca gee agt ccc aag aag aac att aaa act agg agt 1245 
Pro Ala Pro Lys Pro Ala Ser Pro Lys Lys Asn He Lys Thr Arg Ser 
325 330 335 

gee cag aag aga aca aac ccg aaa aga gtg tgagctaact agtttccaaa 1295 
Ala Gin Lys Arg Thr Asn Pro Lys Arg Val 
340 345 

geggagaett ccgacttcct tacaggatga ggctgggcat tgcctgggac agcctatgta 1355 

aggccatgtg ccccttgccc taacaactca ctgcagtgct cttcatagac acatcttgea 1415 

gcatttttct taaggctatg cttcagtttt tctttgtaag ccatcacaag ccatagtggt 1475 

aggtttgece tttggtacag aaggtgagtt aaagctggtg gaaaaggctt attgeattge 1535 

attcagagta acctgtgtgc atactctaga agagtaggga aaataatget tgttacaatt 1595 

cgacctaata tgtgcattgt aaaataaatg ccatatttca aacaaaacac gtaatttttt 1655 

tacagtatgt tttattacct tttgatatct gttgttgcaa tgttagtgat gttttaaaat 1715 

gtgatgaaaa tataatgttt ttaagaagga acagtagtgg aatgaatgtt aaaagatctt 1775 

tatgtgttta tggtctgcag aaggattttt gtgatgaaag gggatttttt gaaaaattag 1835 

agaagtagca tatggaaaat tataatgtgt ttttttacca atgacttcag tttctgtttt 1895 

tagctagaaa cttaaaaaca aaaataataa taaagaaaaa taaataaaaa ggagaggcag 1955 

acaatgtctg gattcctgtt ttttggttac ctgatttcca tgatcatgat gcttcttgtc 2015 

aacaccctct taagcagcac cagaaacagt gagtttgtct gtaccattag gagttaggta 2075 

ctaattagtt ggctaatget caagtatttt atacccacaa gagaggtatg tcactcatct 2135 

tacttcccag gacatccacc ctgagaataa tttgacaagc ttaaaaatgg ccttcatgtg 2195 

agtgccaaat tttgtttttc ttcatttaaa tattttcttt gectaaatae atgtgagagg 2255 

agttaaatat aaatgtacag agaggaaagt tgagttccac ctctgaaatg agaattactt 2315 
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gacagttggg atactttaat cagaaaaaaa gaacttattt gcagcatttt atcaacaaat 2375 

ttcataattg tggacaattg gaggcattta ttttaaaaaa caattttatt ggccttttgc 2435 

taacacagta agcatgtatt ttataaggca ttcaataaat gcacaacgcc caaaggaaat 2495 

aaaatcctat ctaatcctac tctccactac acagaggtaa tcactattag tattttggca 2555 

tattattctc caggtgtttg cttatgcact tataaaatga tttgaacaaa taaaactagg 2615 

aacctgtata catgtgtttc ataacctgcc tcctttgctt ggccctttat tgagataagt 2675 

tttcctgtca agaaagcaga aaccatctca tttctaacag ctgtgttata ttccatagta 2735 

tgcattactc aacaaactgt tgtgctattg gatacttagg tggtttcttc actgacaata 2795 
ctgaataaac atctcaccgg aattc 



<210> 70 
<211> 346 
<212> PRT 

<213> NM_003014 SFRP4, secreted f rizzled-related protein 4 
<400> 70 

Met Phe Leu Ser He Leu Val Ala Leu Cys Leu Trp Leu His Leu Ala 
15 10 15 

Leu Gly Val Arg Gly Ala Pro Cys Glu Ala Val Arg He Pro Met Cys 
20 25 30 

Arg His Met Pro Trp Asn He Thr Arg Met- Pro Asn His Leu His His 
3 5 40 45 

Ser Thr Gin Glu Asn Ala He Leu Ala He Glu Gin Tyr Glu Glu Leu 
50 55 60 

Val Asp Val Asn Cys Ser Ala Val Leu Arg Phe Phe Phe Cys Ala Met 
65 70 75 - 80 

Tyr Ala Pro He Cys Thr Leu Glu Phe Leu His Asp Pro He Lys Pro 
85 90 95 

Cys Lys Ser Val Cys Gin Arg Ala Arg Asp Asp Cys Glu Pro Leu Met 
!00 105 no 

Lys Met Tyr Asn His Ser Trp Pro Glu Ser Leu Ala Cys Asp Glu Leu 
115 120 125 

Pro Val Tyr Asp Arg Gly Val Cys He Ser Pro Glu Ala He Val Thr 
130 135 140 



2820 
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Asp Leu Pro Glu Asp Val Lys Trp lie Asp lie Thr Pro Asp Met Met 
145 150 155 160 

Val Gin Glu Arg Pro Leu Asp Val Asp Cys Lys Arg Leu Ser Pro Asp 
165 170 175 

Arg Cys Lys Cys Lys Lys Val Lys Pro Thr Leu Ala Thr Tyr Leu Ser 
180 185 190 

\ 

Lys Asn Tyr Ser Tyr Val lie His Ala Lys lie Lys Ala Val Gin Arq 
195 200 205 

Ser Gly Cys Asn Glu Val Thr Thr Val Val Asp Val Lys Glu lie Phe 
210 215 220 

Lys Ser Ser Ser Pro He Pro Arg Thr Gin Val Pro Leu He Thr Asn 
225 2 30 235 240 

Ser Ser Cys Gin Cys Pro His He Leu Pro His Gin Asp Val Leu He 
245 250 255 

Met Cys Tyr Glu Trp Arg Ser Arg Met Met Leu Leu Glu Asn Cys Leu 
260 265 270 

Val Glu Lys Trp Arg Asp Gin Leu Ser Lys Arg Ser He Gin Trp Glu 
275 280 . 285 

Glu Arg Leu Gin Glu Gin Arg Arg Thr Val Gin Asp Lys Lys Lys Thr 
290 295 300 

Ala Gly Arg Thr Ser Arg Ser Asn Pro Pro Lys Pro Lys Gly Lys Pro 
305 310 315 320 

Pro Ala Pro Lys Pro Ala Ser Pro Lys Lys Asn He Lys Thr Arg Ser 
325 330 335 

Ala Gin Lys Arg Thr Asn Pro Lys Arg Val 
340 345 

<210> 71 
<211> 1362 " 
<212> DNA 

<213> NMJ)04039 annexin A2 



<220> 
<221> 
<222> 



CDS 

(50) . . (1066) 
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<223> 



<400> 71 

catttgggga cgctctcagc tctcggcgca cggcccagct tccttcaaa atg tct act 58 

Met Ser Thr 
1 

gtt cac gaa ate ctg tgc aag etc age ttg gag ggt gat cac tct aca 106 
Val His Glu lie Leu Cys Lys Leu Ser Leu Glu Gly Asp His Ser Thr 
5 10 15 

ccc cca agt gca tat ggg tct gtc aaa gec tat act aac ttt gat get 154 
Pro Pro Ser Ala Tyr Gly Ser Val Lys Ala Tyr Thr Asn Phe Asp Ala 
20 25 30 35 

gag egg gat get ttg aac att gaa aca gee ate aag acc aaa ggt gtg • 202 
Glu Arg Asp Ala Leu Asn He Glu Thr Ala He Lys Thr Lys Gly Val 
40 45 50 

gat gag gtc acc att gtc aac att ttg acc aac cgc age aat gca cag 250 
Asp Glu Val Thr He Val Asn He Leu Thr Asn Arg Ser Asn Ala Gin 
55 60 65 

aga cag gat att gee ttc gee tac cag aga agg acc aaa aag gaa ctt 298 
Arg Gin Asp He Ala Phe Ala Tyr Gin Arg Arg Thr Lys Lys Glu Leu 
70 75 80 

gca tea gca ctg aag tea gee tta tct ggc cac ctg gag acg gtg att 346 
Ala Ser Ala Leu Lys Ser Ala Leu Ser Gly His Leu Glu Thr Val He 
85 90 95 

ttg ggc eta ttg aag aca cct get cag tat gac get tct gag eta aaa 394 
Leu Gly Leu Leu Lys Thr Pro Ala Gin Tyr Asp Ala Ser Glu Leu Lys 
100 105 HO 115 

get tec atg aag ggg ctg gga acc gac gag gac tct etc att gag ate 442 
Ala Ser Met Lys Gly Leu Gly Thr Asp Glu Asp Ser Leu He Glu He 
120 125 130 

ate tgc tec aga acc aac cag gag ctg cag gaa att aac aga gtc tac 4 90 

He Cys Ser Arg Thr Asn Gin Glu Leu Gin Glu He Asn Arg Val Tyr 
135 140 145 

aag gaa atg tac aag act gat ctg gag aag gac att att teg gac aca 538 
Lys Glu Met Tyr Lys Thr Asp Leu Glu Lys Asp He lie Ser Asp Thr 
150 155 160 

tct ggt gac ttc cgc aag ctg atg gtt gee ctg gca aag ggt aga aga 586 
Ser Gly Asp Phe Arg Lys Leu Met Val Ala Leu Ala Lys Gly Arg Arg 
165 170 • 175 

gca gag gat ggc tct gtc att gat tat gaa ctg att gac caa gat get 634 
Ala Glu Asp Gly Ser Val He Asp Tyr Glu Leu He Asp Gin Asp Ala 
180 185 190 195 

egg gat etc tat gac get gga gtg aag agg aaa gga act gat gtt ccc 682 
Arg Asp Leu Tyr Asp Ala Gly Val Lys Arg Lys Gly Thr Asp Val Pro 
200 205 " ~ 210 

aag tgg ate age ate atg acc gag egg age gtg ccc cac etc cag aaa . 730 
Lys Trp He Ser He Met Thr Glu Arg Ser Val Pro His Leu Gin Lys 
215 220 225 

gta ttt gat agg tac aag agt tac age cct tat gac atg ttg gaa age 778 
Val Phe Asp Arg Tyr Lys Ser Tyr Ser Pro Tyr Asp Met Leu Glu Ser 
230 235 240 
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ate agg aaa gag gtt aaa gga gac ctg gaa aat get ttc ctg aac ctg 826 
He Arg Lys Glu Val Lys Gly Asp Leu Glu Asn Ala Phe Leu Asn Leu 
245 250 255 

gtt cag tgc att cag aac- aag ccc ctg tat ttt get gat egg ctg tat 874 
Val Gin Cys He Gin Asn Lys Pro Leu Tyr Phe Ala Asp Arg Leu Tyr 
260 265 270 275 

gac tec atg aag ggc aag ggg acg cga gat aag gtc ctg ate aga ate 922 
Asp Ser Met Lys Gly Lys Gly Thr Arg Asp Lys Val Leu He Arg He 
280 285 290 

atg gtc tec cgc agt gaa gtg gac atg.ttg aaa att agg tct gaa ttc 970 
Met Val Ser Arg Ser Glu Val Asp Met Leu Lys He Arg Ser Glu Phe 
295 300 . 305 

aag aga aag tac ggc aag tec ctg tac tat tat ate cag caa gac act 1018 
Lys Arg Lys Tyr Gly Lys Ser Leu Tyr Tyr Tyr He Gin Gin Asp Thr 
310 315 320 

aag ggc gac tac cag aaa gcg ctg ctg tac ctg tgt ggt gga gat gac 1066 
Lys Gly Asp Tyr Gin Lys Ala Leu Leu Tyr Leu Cys Gly Gly Asp Asp 
325 330 335 

tgaagecega cacggcctga gcgtccagaa atggtgctca ccatgcttcc agctaacagg 1126 

tctagaaaac cagcttgega ataacagtcc ccgtggccat ccctgtgagg gtgacgttag 1186 

cattaccccc aacctcattt tagttgecta ageattgect ggccttcctg tctagtctct 124 6 

cctgtaagcc aaagaaatga acattccaag gagttggaag tgaagtctat gatgtgaaac 1306 

actttgcctc ctgtgtactg tgtcataaac agatgaataa actgaatttg tacttt 1362 

<210> 72 

<211> 339 

<212> PRT 

<213> NMJ)04039 annexin A2 



<400> 72 

Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Asp 
1 5 10 15 

His Ser Thr Pro Pro Ser Ala Tyr Gly Ser Val Lys Ala Tyr Thr Asn 
20 25 30 



Phe Asp Ala Glu Arg Asp Ala Leu Asn He Glu Thr Ala He Lys Thr 
35 40 45 



Lys Gly Val Asp Glu Val Thr He Val Asn He Leu Thr Asn Arg Ser 
50 55 60 



Asn Ala Gin Arg Gin Asp He Ala Phe Ala Tyr Gin Arg Arg Thr Lys 
65 70 75 " 80 
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Lys Glu. Leu Ala Ser Ala Leu Lys Ser Ala Leu Ser Gly His Leu Glu 
85 .90 95 



Thr Val lie Leu Gly Leu Leu Lys Thr Pro Ala Gin Tyr Asp Ala Ser 
100 105 no 



Glu Leu Lys Ala Ser Met Lys Gly Leu Gly Thr Asp Glu Asp Ser Leu 
115 120 125 



lie Glu He He Cys Ser Arg Thr Asn Gin Glu Leu Gin Glu He Asn 
130 135 140 



Arg Val Tyr Lys Glu Met Tyr Lys Thr Asp Leu Glu Lys Asp He He 
145 150 . 155 160 



Ser Asp Thr Ser Gly Asp Phe Arg Lys Leu Met Val Ala Leu Ala Lys 
165 170 175 



Gly Arg Arg Ala Glu Asp Gly Ser Val lie Asp Tyr Glu Leu He Asp 
180 185 190 



Gin Asp Ala Arg Asp Leu Tyr Asp Ala Gly Val Lys Arg Lys Gly Thr 
195 200 205 



Asp Val .Pro Lys Trp He Ser He Met Thr Glu Arg Ser Val Pro His 
210 215 . 220 



Leu Gin Lys Val Phe Asp Arg Tyr Lys Ser Tyr Ser Pro Tyr Asp Met 
225 230 235- 240 



Leu Glu Ser He Arg Lys Glu Val Lys Gly Asp Leu Glu Asn Ala Phe 
245 250 255 



Leu Asn Leu Val Gin Cys He Gin Asn Lys Pro Leu Tyr Phe Ala Asp 
260 265 270 



Arg Leu Tyr Asp Ser Met Lys Gly Lys Gly Thr Arg Asp Lys Val Leu 
275 280 285 



He Arg He Met Val Ser Arg Ser Glu Val Asp Met Leu Lys He Arg 
290 295 300 



Ser Glu Phe Lys Arg Lys Tyr Gly Lys Ser Leu Tyr Tyr Tyr He Gin 
305 310 315 320 



Gin Asp Thr Lys Gly Asp Tyr Gin Lys Ala Leu Leu Tyr Leu Cys Gly 
325 330 335 



Gly Asp Asp 



WO 2004/022778 



201 



PCT/AU2003/001166 



<210> 73 

<211> 850 

<212> DNA 

<213> NM_003955 SOCS3 
<220> 

<221> CDS 

<222> (107) . . (781) 

<223> 



<400> 73 

gcgccttcct ctccgcagcc ccccgggatg cggtagcggc cgctgtgcgg aggccgcgaa 60 

gcagctgcag ccgccgccgc gcagatccac gctggctccg tgcgcc atg gtc acc 115 

Met Val Thr 
1 

cac age aag ttt ccc gec gec ggg atg age cgc ccc ctg gac acc age 163 
His Ser Lys Phe Pro Ala Ala Gly Met Ser Arg Pro Leu Asp Thr Ser 
5 10 .15 

ctg cgc etc aag acc ttc age tec aag age gag tac cag ctg gtg gtg 211 
Leu Arg Leu Lys Thr Phe Ser Ser Lys Ser. Glu Tyr Gin Leu Val Val 
20 25 30 35 

aac gca gtg cgc aag ctg cag gag age ggc ttc tac tgg age gca gtg 259 
Asn Ala Val Arg Lys Leu Gin Glu Ser Gly Phe Tyr Trp Ser Ala Val 
40 45 50 

acc ggc ggc gag gcg aac ctg ctg etc agt gee gag ccc gee ggc acc 307 
Thr Gly Gly Glu Ala Asn Leu Leu Leu Ser Ala Glu Pro Ala Gly Thr 
55 60 65 

ttt ctg ate cgc gac age teg gac cag cgc cac ttc ttc acg etc age 355 
Phe Leu lie Arg Asp Ser Ser Asp Gin Arg His Phe Phe Thr Leu Ser 
70 75 80- 



gtc aag acc cag tct ggg acc aag aac ctg cgc ate cag tgt gag ggg 
Val Lys Thr Gin Ser Gly Thr Lys Asn Leu Arg He Gin Cys Glu Gly 
85 90 95 



403 



ggc age ttc tct ctg cag age gat ccc egg age aCg cag ccc gtg ccc 451 
Gly Ser Phe Ser Leu Gin Ser Asp Pro Arg Ser Thr Gin Pro Val Pro 
100 105 110 115 

cgc ttc gac tgc gtg etc aag ctg gtg tac cac tac atg ccg ccc cct 499 
Arg Phe Asp Cys Val Leu Lys Leu Val Tyr His Tyr Met Pro Pro Pro 
120 125 130 

gga gec ccc tec ttc ccc teg cca cct act gaa ccc tec tec gag gtg 547 
Gly Ala Pro Ser Phe Pro Ser Pro Pro Thr Glu Pro Ser Ser Glu Val 
135 140 145 

ccc gag cag ccg tct gec cag cca etc cct ggg agt ccc ccc aga aga 595 
Pro Glu Gin Pro Ser Ala Gin Pro Leu Pro Gly Ser Pro Pro Arg Arg 
150 155 160 

gec tat tac ate tac tec ggg ggc gag aag ate ccc ctg gtg ttg age 643 
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165 170 175 



180 185 190 



200 205 

ggg ccc att egg gag ttc ctg gac cag tac gat gec ccg ctt 781 
Gly Pro lie Arg Glu Phe Leu Asp Gin Tyr Asp Ala Pro Leu 

215 220 225 . 

taaggggtaa agggegcaaa gggcatgggt egggagaggg gacgcaggcc cctctcctcc 841 

gtggcacat 850 



Leu 


Ser 




egg 


aag 


691 


Arg 


Lys 






195 




ctg 


ccg 


739 


Leu 


Pro 




210 







<210> 


74 


<211> 


•225 


<212> 


PRT 


<213> 


NM 003955 SOCS3 



<400> 74 . 

Met Val Thr His Ser Lys Phe Pro Ala Ala Gly Met Ser Arg Pro Leu 
15 10 15 



Asp Thr Ser Leu Arg Leu Lys Thr Phe Ser Ser Lys Ser Glu Tyr Gin 
20 25 30 



Leu Val Val Asn Ala Val Arg Lys Leu Gin Glu Ser Gly Phe Tyr Trp 
35 40 45 



Ser Ala Val Thr Gly Gly Glu Ala Asn Leu Leu Leu Ser Ala Glu Pro 
50 55 60 



Ala Gly Thr Phe Leu lie Arg Asp Ser Ser Asp Gin Arg His Phe Phe 
65 70 75 80 



Thr Leu Ser Val Lys Thr Gin Ser Gly Thr Lys Asn Leu Arg lie Gin 
85 90 95 



Cys Glu Gly Gly Ser Phe Ser Leu Gin Ser Asp Pro Arg Ser Thr Gin 
100 105 110 



Pro Val Pro Arg Phe Asp Cys Val Leu Lys Leu Val Tyr His Tyr Met 
115 120 125 



Pro Pro Pro Gly Ala Pro Ser Phe Pro Ser Pro Pro Thr Glu Pro Ser 
130 135 140 
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Ser Glu Val Pro Glu Gin Pro Ser Ala Gin Pro Leu Pro Gly Ser Pro 
"5 150 . 155 " i 60 



Pro Arg Arg Ala Tyr Tyr He Tyr Ser Gly Gly Glu Lys He Pro Leu 
165 170 175 



Val Leu Ser Arg Pro Leu Ser Ser Asn Val Ala Thr Leu Gin His Leu 
180 185 190 



Cys Arg Lys Thr Val Asn . Gly His Leu Asp Ser Tyr Glu Lys Val Thr 
195 200 205 



Gin Leu Pro Gly Pro He Arg Glu Phe Leu Asp Gin Tyr Asp Ala Pro 
210 215 220 



Leu 
225 



<210> 75 

<211> 369 

<212> DNA 

<213> NM_000331 SAA1, serum amyloid Al 



<220> 

<221> CDS 

<222> (1)..{366) 
<223> 



<400> 75 

atg aag ctt etc acg ggc ctg gtt ttc tgc tec ttg gtc ctg ggt gtc 48 
Met Lys Leu Leu Thr Gly Leu Val Phe Cys Ser Leu Val Leu Gly Val 
1 5 10 15 

age age cga age ttc ttt teg ttc ctt ggc gag get ttt gat ggg get 96 
Ser Ser Arg Ser Phe Phe Ser Phe Leu Gly Glu Ala Phe Asp Gly Ala 
20 25 30 

egg gac atg tgg aga gee tac tct gac .atg aga gaa gee aat tac ate 144 
Arg Asp Met Trp Arg Ala Tyr Ser Asp Met Arg Glu Ala Asn Tyr He 
35 40 45 

ggc tea gac aaa tac ttc cat get egg ggg aac tat gat get gee aaa 192 
Gly Ser Asp Lys Tyr Phe His Ala Arg Gly Asn Tyr Asp Ala Ala Lys 
50 55 60 

agg gga cct ggg ggt gtc tgg get gca gaa gcg ate age gat gee aga 240 
Arg Gly Pro Gly Gly Val Trp Ala Ala Glu Ala He Ser Asp Ala Arg 
65 70 75 80 

gag aat ate cag aga ttc ttt ggc cat ggt gcg gag gac teg ctg get 288 
Glu Asn He Gin Arg Phe Phe Gly His Gly Ala Glu Asp Ser Leu Ala 
85 90 95 
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gat cag get gec aat gaa tgg ggc agg agt ggc aaa gac ccc aat cac 336 
Asp Gin Ala Ala Asn Glu Trp Gly Arg Ser Gly Lys Asp Pro Asn His 
100 105 HO 

ttc cga cct get ggc ctg cct gag aaa tac tga 369 
Leu Pro Glu Lys Tyr 
120 





115 


<210> 


76 • 


<211> 


122 


<212> 


PRT 


<213> 


NM_000331 


<400> 


76 



Met Lys Leu Leu Thr Gly Leu Val Phe Cys Ser Leu Val Leu Gly Val 
1 5 10 15 

Ser Ser Arg Ser Phe Phe Ser Phe Leu Gly Glu Ala Phe Asp Gly Ala 
20 25 30 

Arg Asp Met Trp Arg Ala Tyr Ser Asp Met Arg Glu Ala Asn Tyr lie 
35 40 45 

Gly Ser Asp Lys Tyr Phe His Ala Arg Gly Asn Tyr Asp Ala Ala Lys 
50 55 60 • 

Arg Gly Pro Gly Gly Val Trp Ala Ala Glu Ala lie Ser Asp Ala Arg 
65 70 75 80 

Glu Asn lie Gin Arg Phe Phe Gly His Gly Ala Glu Asp Ser Leu Ala 
85 90 95 

Asp Gin Ala Ala Asn Glu Trp Gly Arg Ser Gly Lys Asp Pro Asn His 
100 a05 no 

Phe Arg Pro Ala Gly Leu Pro Glu Lys Tyr 
115 120 



<210> 77 

<211> 895 

<212> DNA 

<213> NM 014059 RGC32 



<220> 

<221> CDS 
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<222> (147) . . (497) 
<223> 



<400> 77 

gcggccgcgt cgaccggcgc ggctggagcg cagcgccgaa gggactggca gggctgaagt 60 

gtgcgggaca gcaagccccc gaatagcccc ggctgccacc tcgcaggacc caaggccacg 120 

cgcgccgggc ccagctgagc cgcctc atg aag ccg ccc gcg gag gac ctg teg 173 

Met Lys Pro Pro Ala Glu Asp Leu Ser 
1 5 

gac gcg ctg tgc gag ttt gac gcg gtg ctg gec gac ttc gcg teg ccc 221 
Asp Ala Leu Cys Glu Phe Asp Ala Val Leu Ala Asp Phe Ala Ser Pro 
10 15 20 ~ 25 

ttc cac gag cgc cac ttc cac tac gag gag cac ctg gag cgc atg aag 269 
Phe His Glu Arg His Phe His Tyr Glu Glu His Leu Glu Arg Met Lys 
30 35 40 

egg cgc age age gee agt gtc age gac age age ggc ttc age gac teg 317 
Arg Arg Ser Ser Ala Ser Val Ser Asp Ser Ser Gly Phe Ser Asp Ser 
45 50 55 

gag agt gca gat tea ctt tat agg aac age ttc age ttc agt gat gaa 365 
Glu Ser Ala Asp Ser Leu Tyr Arg Asn Ser Phe Ser Phe Ser Asp Glu 
60 65 70 

aaa ctg aat tct cca aca gac tct acc cca get ctt etc tct gee act 413 
Lys Leu Asn Ser Pro Thr Asp Ser Thr Pro Ala Leu Leu Ser Ala Thr- 
75 80 85 

gtc act cct cag aaa get aaa tta gga gac aca aaa gag eta gaa gee 461 
Val Thr Pro Gin Lys Ala Lys Leu Gly Asp Thr Lys Glu Leu Glu Ala 
90 95 100 105 

ttc att get gat ctt gac aaa act tta gca agt atg tgaaacaaga 507 
Phe He Ala Asp Leu Asp Lys Thr Leu Ala Ser Met 
U0 . 115 

agttctgggt cctttcatca taagggagaa gcttcagaaa gttccgagga ectgetaaaa 567 

tcagctacta gaatctgctg ccagagggga caaagaegtg cactcaacct tctaccaggc 627 

cactctcagg ctcaccttaa aatcagccct tgatcccatt tctgggcaat ttagacagtg 687 

aaactgactt tgtttacctg ettgeagcat attagaacag acgatccatg ctaatattgt 747 

attttctctt aaaacatagc tttcctgtaa tttaaagtgc ttttatgaaa atatttgtaa 807 

ttaattatat atagttggaa atagcagtaa gctttcccat tataatatat ttttgtatac 867 

aaataaaatt tgaactgaac ctcgtgcc 895 

<210> 78 

<211> 117 

<212> PRT 

<213> NM 014059 RGC32 
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<400>. 78 



Met Lys Pro Pro Ala Glu Asp Leu Ser Asp Ala Leu Cys Glu Phe Asp 
1 5 10 15 



Ala Val Leu Ala Asp Phe Ala Ser Pro Phe His Glu Arg His Phe His 
20 25 30 



Tyr Glu Glu His Leu Glu Arg Met Lys Arg Arg Ser Ser Ala Ser Val 
35 40 45 



Ser Asp Ser Ser Gly Phe Ser Asp Ser Glu Ser Ala Asp Ser Leu Tyr 
50 55 60 



Arg Asn Ser Phe Ser Phe Ser Asp Glu Lys Leu Asn Ser Pro Thr Asp 
65 70 75 80 



Ser Thr Pro Ala Leu Leu Ser Ala Thr Val Thr Pro Gin Lys Ala Lys 
85 90 95 



Leu Gly .Asp Thr Lys Glu Leu Glu Ala Phe He Ala Asp Leu Asp Lys 
100 105 110 



Thr Leu Ala Ser Met 
115 



<210> 79 

<211> 1564 

<212> DNA 

<213> NM 018004 FLJ10134 



<220> 

<221> CDS 

<222> (314) . . (1138) 

<223> 

<400> 79 

gaagcacatc tggacagctg tgcggcctcc ttgcgggccg acgtcagccg agcacgtccc 60 

ccacgtcctc tccttctcgc cacttattat ttattcgttt tcccaaagaa gcgactaggg 120 

acccaagttt aaaaattcct ccccccactc aatgcgagac gtggccagat cccatccaac 180 

acacggttta attttcatgg ggctctggga tcaaaagaac agaaacagca acaacaaaag 240 

cccagccgct gtctgatttt aagctggcaa agtgggaaaa ataaagtgtt gagtaaacag 300 

accaagttgg ate atg ggg aat ttc aga ggt cat gec etc cct gga acc 349 
Met Gly Asn Phe Arg Gly His Ala Leu Pro Gly Thr 
1 5 .10 
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ttc ttt ttt att att ggt ctt tgg tgg tgt aca aag agt att ctg aag ' 397 
Phe Phe Phe lie lie Gly Leu Trp Trp Cys Thr Lys Ser lie Leu lJs 
15 .20 25 

tat ate tgc aaa aag caa aag cga acc tgc tat ctt ggt tec aaa aca 445 
Tyr lie Cys Lys Lys Gin Lys Arg Thr Cys Tyr Leu Gly Ser Lys Thr 
30 35 40 

^ ta L tC 5 at ° ga ttg gaa att ttg Sag.^ga att aca ata gtt ggc atg 493 
Leu Phe Tyr Arg Leu Glu He Leu Glu Gly He Thr He Val Gly Met 
45 50 



55 60 



95 100 



105 



541 



get tta act ggc atg get ggg gag cag ttt att cct gga ggg ccc cat 
Ala Leu Thr Gly Met Ala Gly Glu Gin Phe He Pro Gly Gly Pro His 
65 70 75 

ctg atg tta tat gac tat aaa caa ggt cac tgg aat caa etc ctg ggc 589 
Leu Met Leu Tyr Asp Tyr Lys Gin Gly His Trp Asn Gin Leu Leu Gly 
80 85 90 

tgg cat cat ttc acc atg tat ttc ttc ttt ggg ctg ttg ggt gtg gca 
Trp His His Phe Thr Met Tyr Phe Phe Phe Gly Leu Leu Gly Val Ala 



637 



lit t r q 11° ^ C 3tC dgt tCa Ctt CCt gtg tcc tta acc **9 685 

Asp He Leu Cys Phe Thr He Ser Ser Leu Pro Val Ser Leu Thr Lys 

110 H5 120 

Lon wpf ^ g ?° ^ tfct gtg gag gCC fctt atC ttc tac aa C 733 

Leu Met Leu Ser Asn Ala Leu Phe Val Glu Ala Phe He Phe Tyr Asn 

125 130 135 140 

u a ° g ?° Cgg gaa atg Ctg gac atc fctt gt 9 cac cag ctg ctg 781 

His Thr His Gly Arg Glu Met Leu Asp He Phe Val His Gin Leu Leu 
145 150 155 

Val lit I? r t<3 a u a gg ° CtC gtt gcC ttc cta gag ttc ctt 829 

Val Leu Val Val Phe Leu Thr Gly Leu Val Ala Phe Leu Glu Phe Leu 

160 165 170 

aa ° 3at 9ta Ctt Ctg gag cta ttg cgg tca a ^ c tc att ctg 877 
Val Arg Asn Asn Val Leu Leu Glu Leu Leu Arg Ser Ser Leu He Leu 
175 180 185 

111 r? 9 r? g t9g ttC ttt ° ag att gga ttfc gtc ctg tat ccc ccc 

Leu Gin Gly Ser Trp Phe Phe Gin He Gly Phe Val Leu Tyr Pro Pro 

190 195 200 

agt gga ggt cct gca tgg gat ctg atg gat cat gaa aat att ttg ttt 
Ser Gly Gly Pro Ala Trp Asp Leu Met Asp His Glu Asn He Leu Phe 
205 . 215 220 

etc acc ata tgc ttt tgt. tgg cat tat gca gta acc att gtc atc gtt 1021 
Leu Thr He Cys Phe Cys Trp His Tyr Ala Val Thr He Val He Val 
225 230 235 

r?f, l at g f t ttC att acc tgg ttg gtt aaa tct a * a c tt aag 1069 

Gly Met Asn Tyr Ala Phe He Thr Trp Leu Val Lys Ser Arg Leu Lys 

24 ° 245 ~ 250 

agg etc tgc tcc tca gaa gtt gga ctt ctg aaa aat get gaa cga gaa 1117 
Arg Leu Cys Ser Ser Glu Val Gly Leu Leu Lys Asn Ala Glu Arg Glu 
255 260 265 

caa gaa tca gaa gaa gaa atg tgactttgat gagcttccag tttttctaga 1168 
Gin Glu Ser Glu Glu Glu Met 
270 275 



925 



97.3 
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taaacctttt cttttttaca ttgttcttgg ttttgtttct cgatcttttg tttggagaac 1228 

agctggctaa ggatgactct aagtgtactg tttgcatttc caatttggtt aaagtatttg 1288 

aatttaaata ttttcttttt agctttgaaa atattttggg tgatactttc attttgcaca 1348 

tcatgcacat catggtattc aggggctaga gtgatttttt tccagattat ctaaagttgg 1408 

atgcccacac tatgaaagaa atatttgttt tatttgcctt atagatatgc tcaaggttac 1468 

tgggcttgct actatttgta actccttgac catggaatta tacttgttta tcttgttgct 1528 

. gcaatgagaa ataaatgaat gtatgtattt tggtgc 1564 

<210> 80 

<211> 275 

<212> PRT 

<213> NM_018004 FLJ10134 

<400> 80 

Met Gly Asn Phe Arg Gly His Ala Leu Pro Gly Thr Phe Phe Phe He 
1 5 10 15 

He Gly Leu Trp Trp Cys Thr Lys Ser He Leu Lys Tyr He Cys Lys 
20 25 30 

Lys Gin Lys Arg Thr Cys Tyr Leu Gly Ser Lys Thr Leu Phe Tyr Arg 
35 40 45 

Leu Glu He Leu Glu Gly He Thr He Val Gly Met Ala Leu Thr Glv 
50 55 so 

Met Ala Gly Glu Gin Phe He Pro Gly Gly Pro His Leu Met Leu Tyr 
65 70 75 80 

Asp Tyr Lys Gin Gly His Trp Asn Gin Leu Leu Gly Trp His His Phe 
85 90 95 

Thr Met Tyr Phe Phe Phe Gly Leu Leu Gly Val Ala Asp He Leu Cys 
100 105 . no 

Phe Thr He Ser Ser Leu Pro Val Ser Leu Thr Lys Leu Met Leu Ser 
U5 120 125 

Asn Ala Leu Phe Val Glu Ala Phe He Phe Tyr Asn His Thr His Gly 
13 ° 135 140 

Arg Glu Met Leu Asp He Phe Val His Gin Leu Leu Val Leu Val Val 
145 150 155 160 
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Phe Leu Thr Gly Leu Val Ala Phe Leu Glu Phe Leu Val Arg Asn Asn 
16 5 170 175 

Val Leu Leu Glu Leu Leu Arg Ser Ser Leu lie Leu Leu Gin Gly Ser 
180 185 190 

Trp Phe Phe Gin lie Gly Phe Val Leu Tyr Pro Pro Ser Gly Gly Pro 
195 200 205 

Ala Trp Asp Leu Met Asp His Glu Asn He- Leu Phe Leu Thr He Cys 
210 215 220 

-Phe Cys Trp His Tyr Ala Val -Thr lie Val "lie Val Gly Met Asn Tyr 
225 230 235 240 

Ala Phe He Thr Trp Leu Val Lys Ser Arg Leu Lys Arg Leu Cys Ser 
245 250 255 

Ser Glu Val Gly Leu Leu Lys Asn Ala Glu Arg Glu Gin Glu Ser Glu 
2 60 265 270 

Glu Glu Met 

275 



<210> 81 

<211> 2311 

<212> DNA 

<213> NMJ)04004 GJB2, connexin 26 



<220> 

<221> CDS 

<222> (199) . . (876) 

<223> 



<400> 81 

gatttaatcc tatgacaaac taagttggtt ctgtcttcac ctgttttggt gaggttgtgt 60 

aagagttggt gtttgctcag gaagagattt aagcatgctt gcttacccag actcagagaa 120 

gtctccctgt tctgtcctag ctatgttcct gtgttgtgtg cattcgtctt ttccagagca 180 

aaccgcccag agtagaag atg gat tgg ggc acg ctg cag acg ate ctg ggg 231 
Met Asp Trp Gly Thr Leu Gin Thr He Leu Glv 
1 5 10 

ggt gtg aac aaa cac tec acc age att gga aag ate tgg etc ace gtc 279 
Gly Val Asn Lys His Ser Thr Ser He Gly Lys He Trp Leu Thr Val 
15 20 25 

etc ttc att ttt cgc att atg ate etc gtt gtg get gca aag gag gtg 327 
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Leu Phe He Phe Arg He Met He Leu Val Val Ala. Ala Lys Glu Val 
30 35 40 

tgg gga gat gag cag gcc gac ttt gtc tgc aac acc ctg cag cca ggc 375 
Trp Gly Asp Glu Gin Ala Asp Phe Val Cys Asn Thr Leu Gin Pro Gly 
45 50 55 

tgc aag aac gtg tgc tac gat cac tac ttc ccc ate tec cac ate egg 423 
Cys Lys Asn Val Cys Tyr Asp His Tyr Phe Pro He Ser His He Arg 
60 65 .70 75 

. eta tgg gcc ctg cag ctg ate ttc gtg tec age cca gcg etc eta gtg 471 
Leu Trp Ala Leu Gin Leu He Phe Val .Ser Ser Pro Ala Leu Leu Val 
80 85 90 

gcc atg cac gtg gcc tac egg aga cat gag aag aag agg aag ttc ate 519 
Ala Met His Val Ala Tyr Arg, Arg His Glu Lys Lys Arg Lys Phe lie 
95 100 105 

aag ggg gag ata aag agt gaa ttt aag gac ate gag gag ate aaa acc 567 
Lys Gly Glu He Lys Ser Glu Phe Lys Asp He Glu Glu lie Lys Thr 
110 115 120 

cag aag gtc cgc ate gaa ggc tec ctg tgg tgg acc tac aca age age 615 
Gin Lys Val Arg He Glu Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser 
125 • 130 135 

^° f gg gt ° atC ttc gaa gcc gcc ttc at 9 tac 9^ ttc tat 663 
He Phe Phe Arg Val He Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr 

140 145 150 155 

2*? S*? ! aC ga ° ggC ttC tcc atg cag c 99 ct 9 9t9 a ag tgc aac gcc 711 
Val Met Tyr Asp Gly Phe Ser Met Gin Arg Leu Val Lys Cys Asn Ala 

160 165 170 

tgg cct tgt ccc aac act gtg gac tgc ttt gtg tcc egg ccc acg gag 759 
Trp Pro Cys Pro Asn Thr Val Asp Cys Phe Val Ser Arg Pro Thr Glu 
175 180 i 8 5 

ill ?f S? C *u a 9tg ttC atg att gca gtg tct W att tgc ate 807 
Lys Thr Val Phe Thr Val Phe Met He Ala Val Ser Gly He Cys lie 

190 195 200 

ctg ctg aat gtc act gaa ttg tgt tat ttg eta att aga tat tgt tct 855 
Leu Leu Asn Val Thr Glu Leu Cys Tyr Leu Leu He Arg Tyr Cys Ser 
205 210 215 

ggg aag tea aaa aag cca gtt taacgcattg cccagttgtt agattaagaa 906 
Gly Lys Ser Lys Lys Pro Val 
220 225 

atagacagca tgagagggat gaggcaaccc gtgetcaget gtcaaggctc agtcgccagc 966 

atttcccaac acaaagattc tgaccttaaa tgcaaccatt tgaaacccct gtaggectea 1026 

ggtgaaactc cagatgccac aatgagctct gctcccctaa agectcaaaa caaaggecta 1086 

attctatgee tgtcttaatt ttctttcact taagttagtt ccactgagac cccaggctgt 1146 

taggggttat tggtgtaagg tactttcata ttttaaacag aggatategg catttgtttc 1206 

tttctctgag gacaagagaa aaaagecagg ttccacagag gacacagaga aggtttgggt 1266 

gtcctcctgg ggttcttttt gccaactttc cccacgttaa aggtgaacat tggttctttc 1326 

atttgetttg gaagttttaa tctctaacag tggacaaagt taccagtgcc ttaaactctg 1386 

ttacactttt tggaagtgaa aactttgtag tatgataggt tattttgatg taaagatgtt 1446 
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ctggatacca ttatatgttc cccctgtttc agaggctcag attgtaatat gtaaatggta 1506 

tgtcattcgc tactatgatt taatttgaaa tatggtcttt tggttatgaa tactttgcag 1566 

cacagctgag agaggctgtc tgttgtattc attgtggtca tagcacctaa caacattgta 1626 

gcctcaatcg agtgagacag actagaagtt cctagttggc ttatgatagc aaatggcctc 1686 

atgtcaaata ttagatgtaa ttttgtgtaa gaaatacaga ctggatgtac caccaactac 1746 

tacctgtaat gacaggcctg tccaacacat ctcccttttc catgctgtgg tagccagcat 1806 

cggaaagaac gctgatttaa agaggtgagc ttgggaattt tattgacaca gtaccattta 1866 

atggggagac aaaaatgggg gccaggggag ggagaagttt ctgtcgttaa aaacgagttt 1926 

ggaaagactg gactctaaat tctgttgatt aaagatgagc tttgtctacc ttcaaaagtt 1986 

tgtttggctt acccccttca gcctccaatt ttttaagtga aaatataact aataacatgt 2046 

gaaaagaata gaagctaagg tttagataaa tattgagcag atctatagga agattgaacc . 2106 

tgaatattgc cattatgctt gacatggttt ccaaaaaatg gtactccaca tacttcagtg 2166 

agggtaagta ttttcctgtt gtcaagaata gcattgtaaa agcattttgt aataataaag 2226 

aatagcttta atgatatgct tgtaactaaa ataattttgt aatgtatcaa atacatttaa 2286 

aacattaaaa tataatctct ataat 2311 

<210> 82 

<211> 226 

<212> PRT 

<213> NM_004004 GJB2, connexin 26 

<400> 82 

Met Asp Trp Gly Thr Leu Gin Thr He Leu Gly Gly Val Asn Lys His 
1 5 10 15 

Ser Thr Ser He Gly Lys He Trp Leu Thr Val Leu Phe He Phe Arg 
20 25 30 

He Met He Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gin 
35 40 45 

Ala Asp Phe Val Cys Asn Thr Leu Gin Pro Gly Cys Lys Asn Val Cys 
50 55 60 

Tyr Asp His Tyr Phe Pro He Ser His He Arg Leu Trp Ala Leu Gin 
65 70 75 80 



Leu He Phe Val Ser Ser Pro Ala Leu Leu Val Ala Met His Val Ala 
85 90 95 
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Tyr Arg Arg His Glu Lys Lys Arg Lys Phe He Lys Gly Glu He Lys 
100 * 105 HQ 

Ser Glu Phe Lys Asp He Glu Glu He Lys Thr Gin Lys Val Arg He 
115 120 125 



Glu Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser He Phe Phe Arg Val 
130 135 



140 



He Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Val Met Tyr Asp Gly 



150 



155 



160 



Phe Ser Met Gin Arg Leu Val Lys Cys Asn Ala Trp Pro Cys Pro Asn 
165 ivo 175 



Thr Val Asp Cys Phe Val Ser Arg Pro Thr Glu Lys Thr Val Phe Thr 
I 80 185 " 190 



Val Phe Met He Ala Val Ser Gly He Cys He Leu Leu Asn Val Thr 
i95 200 205 



Glu Leu Cys Tyr Leu Leu He Arg Tyr Cys Ser Gly Lys Ser Lys Lys 
210 • 215 220 



Pro Val 
225 



<210> 83 

<211> 2389 

<212> DNA 

<213> NMJ)02514 NOV1, nephroblastoma overexpressed gene 



<220> 

<221> CDS 

<222> (73) .. (1143) 

<223> 



<400> 83 

gggaaggcga gcagtgccaa tctacagcga agaaagtctc gtttggtaaa agcgagaggg 60 

gaaagcctga gc atg cag agt gtg cag age acg age ttt tgt etc cga aag ill 
Met Gin Ser Val Gin Ser Thr Ser Phe Cys Leu Arg Lys 
1 5 10 

cag tgc ctt tgc ctg acc ttc ctg ctt etc cat etc ctg gga cag gtc 159 
Gin Cys Leu Cys Leu Thr Phe Leu Leu Leu His Leu Leu Gly Gin Val 
15 20 25 

get gcg act cag cgc tgc cct ccc cag tgc ccg ggc egg tgc cct gcg 207 
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Ala Ala Thr Gin Arg Cys Pro Pro Gin Cys Pro Gly Arg Cys Pro Ala 
30 35 40 45 

acg ccg ccg acc tgc gcc ccc ggg gtg cgc gcg gtg ctg gac ggc tgc 255 
Thr Pro Pro Thr Cys Ala Pro Gly Val Arg Ala Val Leu Asp Gly Cys 
50 55 60 

tea tgc tgt ctg gtg tgt gcc cgc cag cgt ggc gag age tgc tea gat 303 
Ser Cys Cys Leu Val Cys Ala Arg Gin Arg Gly Glu Ser Cys Ser Asp 
65 70 75 

ctg gag cca tgc gac gag age agt ggc etc tac tgt gat cgc age gcg 351 
Leu Glu Pro Cys Asp Glu Ser Ser Gly Leu Tyr Cys Asp Arg Ser Ala 
80 85 90 

gac ccc age aac cag act ggc ate tgc acg gcg gta gag gga gat aac 399 
Asp Pro Ser Asn Gin Thr Gly He Cys Thr Ala Val Glu Gly Asp Asn 
95 100 105 

tgt gtg ttc gat ggg gtc ate tac cgc agt gga gag aaa ttt cag cca 447 
Cys Val Phe Asp Gly Val He Tyr Arg Ser Gly Glu Lys Phe Gin Pro 
HO 115 120 125 

age tgc aaa ttc cag tgc acc tgc aga gat ggg cag att ggc tgt gtg 495 
Ser Cys Lys Phe Gin Cys Thr Cys Arg .Asp Gly Gin He Gly Cys Val 
130 135 140 

ccc cgc tgt cag ctg gat gtg eta ctg cct gag cct aac tgc cca get 543 
Pro Arg Cys Gin Leu Asp Val Leu Leu Pro Glu Pro Asn Cys Pro Ala 
145 150 155 

cca aga aaa gtt gag gtg cct gga gag tgc tgt gaa aag tgg ate tgt 591 
Pro Arg Lys Val Glu Val Pro Gly Glu Cys Cys Glu Lys Trp He Cys 
160 165 170 

ggc cca gat gag gag gat tea ctg gga ggc ctt acc ctt gca get tac 639 
Gly Pro Asp Glu Glu Asp Ser Leu Gly Gly Leu Thr Leu Ala Ala Tyr 
175 180 185 

agg cca gaa gcc acc eta gga gta gaa gtc tct gac tea agt gtc aac 687 
Arg Pro Glu Ala Thr Leu Gly Val Glu Val Ser Asp Ser Ser Val Asn 
190 195 200 205 

tgc att gaa cag acc aca gag tgg aca gca tgc tec aag age tgt ggt 735 
Cys He Glu Gin Thr Thr Glu Trp Thr Ala Cys Ser Lys Ser Cys Gly 
210 215 " 220 

atg ggg ttc tec acc egg gtc acc aat agg aac cgt caa tgt gag atg 783 
Met Gly Phe Ser Thr Arg Val Thr Asn Arg Asn Arg Gin Cys Glu Met 
225 230 ' 235 

ctg aaa cag act egg etc tgc atg gtg egg ccc tgt gaa caa gag cca 831 
Leu Lys Gin Thr Arg Leu Cys Met Val Arg Pro Cys Glu Gin Glu Pro 
240 245 250 

gag cag cca aca gat aag aaa gga aaa aag tgt etc cgc acc aag aag 879 
Glu Gin Pro Thr Asp Lys Lys Gly Lys Lys Cys Leu Arg Thr Lys Lys 
255 260 265 

tea etc aaa gcc ate cac ctg cag ttc aag aac tgc acc age ctg cac 927 
Ser Leu Lys Ala He His Leu Gin Phe Lys Asn Cys Thr Ser Leu His 
270 275 280 285 

acc tac aag ccc agg ttc tgt ggg gtc tgc agt gat ggc cgc tgc tgc 975 
Thr Tyr Lys Pro Arg Phe Cys Gly Val Cys Ser Asp Gly Arg Cys Cys 
290 295 300 

act ccc cac aat acc aaa acc ate cag gca gag ttt cag tgc tec cca 1023 
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Thr Pro His Asn Thr Lys Thr He Gin Ala Glu Phe Gin Cys Ser Pro 
305 310 315 

ggg caa ata gtc aag aag cca gtg atg gtc att ggg acc tgc acc tgt 1071 

Gly Gin He Val Lys Lys Pro Val Met Val He Gly Thr Cys Thr Cys 
320 325 330 

cac acc aac tgt cct aag aac aat gag gcc ttc etc cag gag ctg gag 1119 

His Thr Asn Cys Pro Lys Asn Asn Glu Ala Phe Leu. Gin Glu Leu Glu 
335 340 345 

ctg aag act acc aga ggg aaa atg taacctatca ctcaagaagc acacctacag 1173 

Leu Lys Thr Thr Arg Gly Lys Met 
350 355 



agcacctgta 


gctgctgcgc 


cacccaccat 


caaaggaata 


taagaaaagt 


aatgaagaat 


1233 


cacgatttca 


tccttgaatc 


ctatgtattt 


tcctaatgtg 


atcatatgag 


gacctttcat 


1293 


atctgtcttt 


tatttaacaa 


aaaatgtaat 


taactgtaaa 


cttggaatca 


aggtaagctc 


1353 


aggatatggc 


ttaggaatga 


cttactttcc 


tgtggtttta 


ttacaaatgc 


aaatttctat 


1413 


aaatttaaga 


aaacaagtat 


ataatttact 


ttgtagactg 


tttcacattg 


cactcatcat 


1473 


attttgttgt 


gcactagtgc 


aattccaaga 


aaatatcact 


gtaatgagtc 


agtgaagtct 


1533 


agaatcatac 


ttaacatttc 


attgtacaag 


tattacaacc 


atatattgag 


gttcattggg 


1593 


aagattctct 


attggctccc 


tttttgggta 


aaccagctct 


gaacttccaa 


gctccaaatc 


1653 


caaggaaaca 


tgeagctett 


caacatgaca 


tccagagatg 


actattactt 


ttctgtttag 


1713 


ttttacacta 


ggaaacgtgt 


tgtatctaca 


gtaatgaaat 


gtttactaag 


tggactggtg 


1773 


tcataaactt 


tctccattta 


agacacattg 


actcctttcc 


aatagaaaga 


aactaaacag 


1833 


aaaactccca 


atacaaagat 


gactggtccc 


tcatagccct 


cagacattta 


tatattggaa 


1893 


getgetgagg 


cccccaagtt 


ttttaattaa 


gcagaaacag 


catattagca 


gggattctct 


1953 


catctaactg 


atgagtaaac 


tgaggcccaa 


ageacttget 


tacatcctct 


gatagctgtt 


2013 


tcaaatgtgc 


attttgtgga 


attttgagaa 


aaatagagca 


aaatcaacat 


gactggtggt 


2073 


gagagaccac 


acattttatg 


agagtttgga 


attattgtag 


acatgcccaa 


aacttatcct 


2133 


tgggccataa 


ttatgaaaac 


tcatgatcaa 


gatatatgtg 


tatacataca 


tgtatctggt 


2193 


ttgtcaggct 


acaaggtagg 


ctgeaaaatt 


aaatctagac 


attcttttaa 


tgccaccaca 


2253 


cgtgttccgc 


ttctctcttt 


taaagtattt 


ataaaaatat 


aaattgtaca 


ttttgtaaaa 


2313 


tattatgttt 


gatttctcta 


cttgtcatat 


cactaaataa 


acacgatttt 


attgetgaaa 


2373 


aaaaaaaaaa 


aaaaaa 










2389 



<210> 84 

<211> 357 

<212> PRT 

<213> NM_002514 NOV1, nephroblastoma overexpressed gene 
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<400> 84 

Met Gin Ser Val Gin Ser Thr Ser Phe Cys Leu Arg Lys Gin Cys Leu 
1 5 10 15 



Cys Leu Thr Phe Leu Leu Leu His Leu Leu Gly Gin Val Ala Ala Thr 
20 25 30 



Gin Arg Cys Pro Pro Gin Cys Pro Gly Arg Cys Pro Ala Thr Pro Pro 
35 • 40 45 



Thr Cys Ala Pro Gly Val Arg Ala Val Leu Asp Gly Cys Ser Cys Cys 
50 55 60 



Leu Val Cys Ala Arg Gin Arg Gly Glu Ser Cys Ser Asp Leu Glu Pro 
65 70 75 80 



Cys Asp Glu Ser Ser Gly Leu Tyr Cys Asp Arg Ser Ala Asp Pro Ser 
85 90 95 



Asn Gin Thr Gly lie Cys Thr Ala Val Glu Gly Asp Asn Cys Val Phe 
100 105 110 



Asp Gly Val He Tyr Arg Ser Gly Glu Lys Phe Gin Pro Ser Cys Lys 
115 120 125 



Phe Gin Cys Thr Cys Arg Asp Gly Gin He Gly Cys Val Pro Arg Cys 
130 135 140 



Gin Leu Asp Val Leu Leu Pro Glu Pro Asn Cys Pro Ala Pro Arg Lys 
145 150 155 160 



Val Glu Val Pro Gly Glu Cys Cys Glu Lys Trp He Cys Gly Pro Asp 
165 170 175 



Glu Glu Asp Ser Leu Gly Gly Leu Thr Leu Ala Ala Tyr Arg Pro Glu 
180 185 190 



Ala Thr Leu Gly Val Glu Val Ser Asp Ser Ser Val Asn Cys He Glu 
195 200 205 



Gin Thr Thr Glu Trp Thr Ala Cys Ser Lys Ser Cys Gly Met Gly Phe 
210 215 220 



Ser Thr Arg Val Thr Asn Arg Asn Arg Gin Cys Glu Met Leu Lys Gin 
225 230 235 240 



Thr Arg Leu Cys Met Val Arg Pro Cys Glu Gin Glu Pro Glu Gin Pro 
245 250 255 



Thr Asp Lys Lys Gly Lys Lys Cys Leu Arg Thr Lys Lys Ser Leu Lys 
260 " 265 270 
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Ala He His Leu Gin Phe Lys Asn Cys Thr Ser Leu His Thr Tyr Lys 
275 280 285 



Pro Arg Phe Cys Gly Val Cys Ser Asp Gly Arg Cys Cys Thr Pro His 
290 295 300 



Asn Thr Lys Thr He Gin Ala Glu Phe Gin Cys Ser Pro Gly Gin He 
305 310 315 320 



Val Lys , Lys Pro Val Met Val He Gly Thr Cys Thr Cys His Thr Asn 
325 330 335 



Cys Pro Lys Asn Asn Glu Ala Phe Leu Gin Glu Leu Glu Leu Lys Thr 
340 345 350 



Thr Arg Gly Lys Met 
355 
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Observations where certain claims were found unsearchable (Continuation of item 2 of first sheet) 



This international search report has not been established in respect of certain claims under Article 17(2)(a) for the following 
reasons: 

1 . [ j Claims Nos : 

because they relate to subject matter not required to be searched by this Authority, namely: 



X Claims Nos: 70-73 



□ 



because they relate to parts of the international application that do not comply with the prescribed requirements to 
such an extent that no meaningful international search can be carried out, specifically: 

The claims are not limited to the technical features of the invention as disclosed in, and supported by, 
the specification. The claims relate to methods of assessing the promoter regions of specific genes with 
respect to hyperrnethylation and ovarian cancer. However the specification only discloses sequences * 
derived from cDNA clones and does not disclose any promoter sequences. 

Claims Nos : 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 
6.4(a) 



Box II 



Observations where unity of invention is lacking (Continuation of item 3 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
See supplementary sheet 



I I As all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims 

[" I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 
payment of any additional fee. 

|X I As only some of the required additional search fees were timely paid by the applicant, this international search 

report covers only those claims for which fees were paid, specifically claims Nos.:2-5, 7, 9, 15, 19, 20, 22, 24, 30, 

45 and 47-50 in full and claims 1, 10-12, 16-18, 25-27, 31-43, 51-53 and 60-69 as they relate to table 3. 



I I No required additional search fees were timely paid by the applicant Consequently, this international search report 
' — is restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest [ | The additional search fees were accompanied by the applicant's protest 

|X | No protest accompanied the payment of additional search fees. 
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Continuation of Box No: H 

The application claims more than one invention. Rule 1 3 . 1 of the PCT states the principle that an International 
Application should relate to only one invention or, if there is more than one invention, that the inclusion of those 

' inventions in one International Application is only permitted if all inventions are so linked as to form a single 
general inventive concept. 

Rule 1 3.2 of the PCT defines the method for determining whether the requirement of unity of invention is satisfied 
in respect of a group of inventions claimed in an International application. Unity of invention exists only when 
there is a technical relationship among the claimed inventions involving one or more of the same or corresponding 
"special technical features." TTie expression "special technical features" is defined in Rule 13.2 as meaning those 
technical features that define a contribution which each of the inventions, considered as a whole, makes over the 
prior art. The determination is made on the contents of the claims as interpreted in light of the description and 
drawings (if any). 

The claims and the description relate to methods for the diagnosis, prediction and treatment of ovarian cancer, using 
gene and peptide sequences whose expression is altered in ovarian cancer. In particular the claims define methods 
relating to the use of over 693 specific gene and/or peptide sequences. 

Although all of the claims share the common feature that they all relate to methods or apparatus that involve the use of 
specific gene or peptide sequences whose expression is altered in ovarian cancer,. this feature is known (see the 
documents listed below). As such this feature cannot be regarded as a special technical feature and cannot confer 
unity on the inventions relating to the use of specific gene or peptide sequences for the detection or treatment of 
ovarian cancer. 

Dl US 5 912 142 
D2 US 6 268 165 
D3 WO01 21653 

D4 Bayani et al (2002) Cancer Research 62, 3466-76 

Furthermore, there is nothing in the specification to suggest that the genes can be further divided into groups, where 
sequences within a group share a common special technical feature. " Although the specification discloses that some of 
the genes can be classified into further, narrower groups, these further divisions also cannot be regarded as special 
technical features. These divisions correspond to groups of genes that are down-regulated or up-regulated in cancers 
and genes that are associated with specific types of ovarian cancers, such as epithelial or mucinous ovarian cancer. 
However these groups are known, as are methods of using genes specifically associated with these narrower groups for 
the treatment and detection of specific subsets of ovarian cancers. See in particular D4 

Although there is a lack of unity the ISA has searched, as a service to the applicant, five sets often sequences for five 
search fees. 

These sequences comprise SEQ ID NOS 1- 7, 9, 11,13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 
46, 48, 50, 52, 53, 55, 57, 59, 60- 63, 65, 67, 69, 71-75, 77, 79, 81, 83. 
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